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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Inouye, Sumiko 
Hsu, Mei-Yin 
Eagle, Susan 
Inouye, Masayori 

(ii) TITLE OF INVENTION: Prokaryotic Reverse Transcriptase 
(iii) NUMBER OF SEQUENCES: 42 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Weiser & Associates 

(B) STREET: 230 South Fifteenth 

(C) CITY: Philadelphia 

(D) STATE: Pennsylvania 
,(E) COUNTRY: U.S.A. 

(F) ZIP: 19102 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS/u^i^/ / 

(D) SOFTWARE: Patentin Release/il.d^^^sion #1.25 

(vi) CURRENT APPLICATION DATA: / 

(A) APPLICATION NUMBER: US 08/269,118 

(B) FILING DATE: lO-JUN-1994 

(C) CLASSIFICATION: 7 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Weiser, Gerard J. 

(B) REGISTRATION NUMBER: 19,763 

(C) REFERENCE/DOCKET/NUMBER: 377.5888P 




(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: 215/875-8383 

(B) TELEFAX:. 215-875-8394 

(2) INFORMATION FOR SEQ rD^NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2'l76 base pairs 

(B) TYPE: nufcleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOpY: linear 



(ix) FEATURE: / 

(A) NAME/KEY: CDS 

(B) LOCATION: 640.. 2094 



(xi) SEQUENCT DESCRIPTION: SEQ ID N0:1: 
TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 
TCTACCGCGT TT^:CCTGGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 
CGCCGC6TCA CCAGCGGCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 



60 
120 
180 



0 



• 



CGCGCCTCGA GAAGTTGGCG GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCT 

CCCTTCAACC ACGGCTCGGC CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGA<^ 

ACGACGTGCG CTTCACGCGC GAGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGA<k:GC 

CTCGAGCGGC GGAGCGGCGT TGCGCCGCTC CGGTTGGAAT GCAGGACACT CTCC^AAGG 

TAGCCTGTTC TTGGCTCTCT CCCTCCTAGG CACTACGGCC AGGGTGGGTA GG^AGCCAA 

CGACGCCACC GCCGTTTACC CACCCCGGCC GTAGTGCCTA GGAGGGGAGA/feCCGGTGAGG 

CTACCGTGCC CCAGGTAAGA TGGTGGTGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 

GTCCCGTCTT CCATCGCCGC GCCCGCCCAA GGTGCAGAC ATG ACC/GCe AGG CTG 

Met Tlrp/Ala Arg Leu 
5 




GAC CCG TTC GTC CCC GCA GOT TCG COG CAG 
Asp Pro Phe Val Pro Ala Ala Ser Pro Gin 
10 15 

CTC ACC GOT CCG TCG TCA GAC GCG GCC GCG 
Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala 
25 30 




ACG CCC GAG 
Thr Pro Glu 
20 

GCC CGC CGG 
Ala Arg Arg 
35 



CTC GCG CAC GAA GCG TTG CTC GTC CGC GCG' AAG^GCC ATC GAC GAA GCG 
Leu Ala His Glu Ala Leu Leu Val Arg KIA Lys Ala He Asp Glu Ala 
40 45 7 50 

GGC GGC GCC GAC GAC TGG GTG CAG GCG/tAG CTC GTC TCC AAG GGG CTC 
Gly Gly Ala Asp Asp Trp Val Gin Al/ Gin Leu Val Ser Lys Gly Leu 
55 60 / 65 

GCG GTC GAG GAC CTG GAC TTC TCC /gC GCC TCC GAG AAG GAC AAG AAG 
Ala Val Glu Asp Leu Asp Phe Ser/ser Ala Ser Glu Lys Asp Lys Lys 

75 / 80 85 

GCC TGG AAG GAG AAG AAG AAG 0^:C GAG GCC ACC GAG CGC CGC GCG CTG 
Ala Trp Lys Glu Lys Lys Lye Ala Glu Ala Thr Glu Arg Arg Ala Leu 
. ^° / 95 100 

AAG CGT CAG GCG CAC GAG GCfe TGG AAG GCC ACG CAC GTG GGC CAC CTG 
Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr His Val Gly His Leu 
105 / 110 111 

GGC GCG GGC GTG CAC T<J GCG GAG GAC CGC CTG GCC GAC GCG TTC GAC 
Gly Ala Gly Val His Tr^ Ala Glu Asp Arg Leu Ala Asp Ala Phe Asp 
120 / 125 130 



GTG CCC CAC CGC GAG GAG CGC GCC CGG GCC AAC GGC CTG ACG GAG CTG 
Val Pro His Arg GIu/gIu Arg Ala Arg Ala Asn Gly Leu Thr Glu Leu 
135 / 140 

GAC TCC GCG GAG G9G CTG GCC AAG GCG CTG GGG CTG AGC GTC TCC AAG 
Asp ser Ala Glu Ala Leu Ala Lys Ala Leu Gly Leu Ser Val Ser Lys 
' 155 160 165 

CTC CGC TGG TTC /gCG TTC CAC CGG GAG GTC GAC ACG GCC ACG CAC TAG 
Leu Arg Trp Phe/Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr 
' 170 175 180 



240 
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1038 
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1134 



1182 



/ 



185 



190 



200 



205 



210 



215 



220 



225 



ACG 
Thr 
195 


ATT 
He 


ACG 
Thr 


CTG 
Leu 


TCC 
Ser 


AA^ 

KBVi 


TTC 
Phe 


GJQ 
Val 


GCG 
Ala 


GCfG 


GAC 


GTC 
Val 
245 



GGA CGC TCC ATC CTC ACC AAC GCG CTG GCC CAC CAG GGC 
Gly Arg Ser He Leu Thr Asn Ala Leu Ala His Gin Glv 
230 235 240 

GTG GTC AAG GTG GAC CTC AAG GAC TTC TTC CCC TCC G'/c /icc TGG CGC 
Val Val Lys Val Asp Leu Lys Asp Phe Phe Pro Ser Vky Thr Trp Aro 
250 255 / / 260 



CGG GTG AAG GGC CTG TTG CGC AAG GGC 
Arg Val Lys Gly Leu Leu Arg Lys Gly 
265 270 

ACG CTG CTG TCC CTC CTC TCC ACG GAA 
Thr Leu Leu Ser Leu Leu Ser Thr Glu 
280 285 



GGC CTG CGTG/GAG 
Gly Leu ^rq Glu 




GGC ACG TCC 
Gly^lir Ser 



GCG CpG CGG GAGT GCG GTC CAG 
Ala ^o/Arg jAn Ala Val Gin 
290 



TTC CGC GGC AAG CTC CTG CAC GTC GCC AAG GGC CCG CGC GCC CTG CCC 
Phe Arg Gly Lys Leu Leu His Val Ala L^s Gly Pro Arg Ala Leu Pro 
295 300 / 305 



CAG GGC GCC CCC ACG TCG CCC GGC ... 
Gin Gly Ala Pro Thr Ser Pro Gly I] 
310 315 ^ 



ACC AAC GCG CTC TGC CTG AAG 
Thr Asn Ala Leu Cys Leu Lys 
320 325 



CTC GAC AAG CGG CTG TCC GCC CTc/gCG AAG CGG CTG GGC TTC ACC TAC 
Leu Asp Lys Arg Leu Ser Ala Le/ Ala Lys Arg Leu Gly Phe Thr Tvr 
330 / 335 340 

ACG CGC TAC GCG GAC GAC CTG /cC TTC TCC TGG ACG AAG GCG AAG CAG 
Thr Arg Tyr Ala Asp Asp Leu/Thr Phe Ser Trp Thr Lys Ala Lys Gin 
345 / 350 355 



CCC AAG CCG CGG CGG ACG 
Pro Lys Pro Arg Arg Thr 
360 



iQ CGT CCC CCC GTC GCG GTC CTC CTG TCT 
Arg Pro Pro Val Ala Val Leu Leu Ser 
365 370 



iln 



CGC GTC CAG GAA GTG GT6 GAG GCG GAG GGC TTC CGC GTG CAC CCG GAC 
Arg val Gin Glu Val VaTl Glu Ala Glu Gly Phe Arg Val His Pro Asp 
375 / 380 385 

AAG ACG CGC GTC GCC/feGC AAG GGC ACG CGG CAG CGG GTC ACC GGG CTC 
Lys Thr Arg Val Ala/ Arg Lys Gly Thr Arg Gin Arg Val Thr Gly Leu 
-^^^ / 395 400 405 

GTC GTG AAT GCG GCG GGC AAG GAC GCG CCC GCG GCC CGA GTC CCG CGC 
Val Val Asn Ala rfla Gly Lys Asp Ala Pro Ala Ala Arg Val Pro Arq 

415 420 

GAC GTC GTC CGc/cAG CTC CGC GCC GCC ATC CAC AAC CGG AAG AAG GGC 
Asp Val Val Arg Gin Leu Arg Ala Ala He His Asn Arg Lys Lys Gly 
425 430 435 
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1326 



1374 
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1470 



1518 



1566 



1614 



1662 



1710 



1758 



1806 



1854 



1902 



1950 



if 



AAG CCG GGC CGC GAG GGC GAG TCG CTC GAG CAG 
Lys Pro Gly Arg Glu Gly Glu Ser Leu Glu Gin 
440 445 

GCC TTC ATC CAC ATG ACG GAG CCG GCC AAG GGC 
Ala Phe He His Met Thr Asp Pro Ala Lys Gly 
455 460 

CAG CTC ACG GAG CTC GAG TCC ACG GCG AGC GCC 
Gin Leu Thr Glu Leu Glu Ser Thr Ala Ser Ala 
470 475 480 

TGACGCTCAG CGCGCGTCCG TCGCCGACGT GCCGCGCGCC 
CTCCGTCAGC CGGCGCGGGT AC 

(2) INFORMATION FOR SEQ ID NO: 2: 



CTC AAG 
Leu Lys 
450 

CGC GCC 
Arg Ala 
465 



ATTCAGCAA 




1998 

2046 

2094 

2154 
2176 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ lU m:2 

Val Lys Leu Lys Pro Gly Met As^ Gly Pro Lys Val 
^ 5 7 10 

Leu Thr Glu Glu Lys He Lys Ala Leu Val Glu He 
20 7 25 

Glu Lys Glu Gly Lys He s4r Lys He Gly Pro Glu 
35 / 40 

Thr Pro Val Phe Ala lief Lys Lys Lys Asp Ser Thr 
50 / 55 60 

Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin 

Val Gin Leu Gly 11^ Pro His Pro Ala Gly Leu Lvs 
85^ 90 

Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
100 / 105 

^lu Asp Phe Arcf Lys Tyr Thr Ala Phe Thr He Pro 
115 / 120 

Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu 
130 / 135 140 

Lys Gly Ser/pro Ala He Phe Gin Ser Ser Met Thr 
145 / 150 155 

Pro Phe Lyk Lys Gin Asn Pro Asp He Val He Tyr 
' 165 170 



Lys Gin Trp Pro 
15 

Cys Thr Glu Met 
30 

Asn Pro Tyr Asn 
45 

Lys Trp Arg Lys 



Asp Phe Trp Glu 
80 

Lys Lys Lys Ser 
95 

Val Pro Leu Asp 
110 

Ser He Asn Asn 
125 

Pro Gin Gly Trp 



Lys He Leu Glu 
160 

Gin Tyr Met Asp 
175 



Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin His Arg Thr Ly/ 
180 185 190 

He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu Thr Thr/pro 
195 200 205 

Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp Met Gly /^yr Glu 
210 215 220 

Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val Leu ^lo Glu Lys 
225 230 235 / 240 

Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val Gl/ Lys Leu Asn 
245 250 / 255 

Trp Ala Ser Gin He Tyr Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Arg Pro Trp Ala Arg Thr Pro Pro /ys Ala Pro Arg Asn Gin Pro Val 
^ 5 / 

Pro Phe Lys Pro Glu Arg Leu Gin Ala Leu Gin His Leu Val Arg Lys 
20 / 25 30 

Ala Leu Glu Ala Gly His Ile'^Glu Pro Tyr Thr Gly Pro Gly Asn Asn 
35 / 40 . 45 

Pro Val Phe Pro Val Lys Lys Ala Asn Gly Thr Trp Arg Phe He His 
50 /55 60 

Asp Leu Arg Ala Thr Asn Ser Leu Thr He Asp Leu Ser Ser Ser Ser 
65 70 75 80 

Pro Gly Pro Pro Asp /Leu Ser Ser Leu Pro Thr Thr Leu Ala His Leu 
85 / 90 95 

Gin Thr He Asp Leu Arg Asp Ala Phe Phe Gin He Pro Leu Pro Lys 
100 / 105 110 

Gin Phe Gin Pro/Tyr Phe Ala Phe Thr Val Pro Gin Gin Cys Asn Tvr 
115 / 120 125 . 

Gly Pro Gly Thr Arg Tyr Ala Trp Lys Val Leu Pro Gin Gly Phe Lys 
130 / 135 140 




Asn Ser Pro Thr Leu Phe Glu Met Gin Leu Ala His He Leu Gin Pro 
145 / 150 155 160 



He Arg Gin Ala Phe Pro Gin Cys Thr He Leu Gin Tyr Met Asp Asp 
' 165 170 175 



• 



lie Leu Leu Ala Ser Pro Ser His Glu Asp Leu Leu Leu Leu Ser Glu 
180 185 190 

Ala Thr Met Ala Ser Leu lie Ser His Gly Leu Pro Val Ser Glu Asn 
195 200 205 

Lys Thr Gin Gin Thr Pro Gly Thr lie Lys Phe Leu Gly Gin lie /lie 
210 215 220 

Ser Pro Asn His Leu Thr Tyr Asp Ala Val Pro Thr Val Pro l4e Arg 
225 230 235 / 240 



Ser Arg Trp Ala Leu Pro Glu Leu Gin Ala Leu Leu Gly Gl^i lie Gin 



250 



255 



245 

Trp Val Ser Lys Gly Thr Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4': 

Asn Val Leu Tyr Arg lie Gly Ser Asp^Asn Gin Tyr Thr Gin Phe Thr 
1 5 ^ 15 

lie Pro Lys Lys Gly Lys Gly Val Arg Thr lie Ser Ala Pro Thr Asp 
20 /25 30 




Arg Leu Lys Asp lie Gin Arg Arg lie Cys Asp Leu Leu Ser Asp Cys 
35 4^) 45 



[le^ 



Arg Asp Glu lie Phe Ala Ile^Arg Lys lie Ser Asn Asn Tyr Ser Phe 



Gly Phe Glu Arg Gly Lys Ser lie lie Leu Asn Ala Tyr Lys His Arg 
65 70 / 75 80 

Gly Lys Gin lie lie Leu Asn lie Asp Leu Lys Asp Phe Phe Glu Ser 
85 / 90 95 

Phe Asn Phe Gly Arg/val Arg Gly Tyr Phe Leu Ser Asn Gin Asp Phe 
100 7 105 110 

Leu Leu Asn Pro Val Val Ala Thr Thr Leu Ala Lys Ala Ala Cys Tyr 
115 / 120 125 

Asn Gly Thr Leu^ Pro Gin Gly Ser Pro Cys Ser Pro lie lie Ser Asn 

130 / 135 140 

Leu lie Cys Asn lie Met Asp Met Arg Leu Ala Lys Leu Ala Lys Lys 
145 150 155 160 



Tyr Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp lie Thr lie Ser Thr 
165 170 175 



Asn Lys Asn Thr Phe Pro Leu Glu Met Ala Thr Val Gin Pro Glu Gly 
180 185 190 

Val Val Leu Gly Lys Val Leu Val Lys Glu He Glu Asn Ser Gly Phey 
195 200 205 

Glu He Asn Asp Ser Lys Thr Arg Leu Thr Tyr Lys Thr Ser Arg cln 
210 215 220 

Glu Val Thr Gly Leu Thr Val Asn Arg He Val Asn He Asp Arg Cys 
225 230 235 7 240 

Tyr Tyr Lys Lys Thr Arg Ala Leu Ala His Ala Leu Tyr Arg Thr Gly 
245 250 / 255 

Glu Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CPiARACTERISTICS : 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr Val Ser Trp Thr 
1 5 /lO 15 

He Pro Lys Arg Asp Gly Ser Lys Arg Thr He Thr Ser Pro Lys Pro 
20 2% 30 




J.. 



Glu Leu Lys Ala Ala Gin Arg Trp^ Val Leu Ser Asn Val Val Glu Arg 
35 40' 45 



/ 



Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala Gly Arg Ser He 
50 55 / 60 

Leu Thr Asn Ala Leu Ala Hi4 Gin Gly Ala Asp Val Val Val Lys Val 
65 70 / 75 80 

Asp Leu Lys Asp Phe Phe/pro Ser Val Thr Trp Arg Arg Val Lys Gly 
85 / 90 95 

Leu Leu Arg Lys Gly Gly Leu Arg Glu Gly Thr Ser Thr Leu Leu Ser 
100 / 105 110 

Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin Phe Arg Gly Lys 
115 120 125 

Leu Leu His Val Ala Lys Gly Pro Arg Ala Leu Pro Gin Gly Ala Pro 
130 135 140 

Thr Ser Pro Gly He Thr Asn Ala Leu Cys Leu Lys Leu Asp Lys Arg 
145 150 155 160 



Leu Ser Ala Leu Ala Lys Arg Leu Gly Phe Thr Tyr Thr Arg Tyr Ala 
165 170 175 




Asp Asp Leu Thr Phe Ser Trp Thr Lys Ala Lys Gin Pro Lys Pro Arg 
180 185 190 

Arg Thr Gin Arg Pro Pro Val Ala Val Leu Leu Ser Arg Val Gin Glu 
195 200 205 

Val Val Glu Ala Glu Gly Phe Arg Val His Pro Asp Lys Thr Arq 
210 215 220 

Ala Arg Lys Gly Thr Arg Gin Arg Val Thr Gly Leu Val Val A6n Ala 
225 230 235 / 240 

Ala Gly Lys Asp Ala Pro Ala Ala Arg Val Pro Arg Asp Va'l Val Arg 
245 250 / 255 

Gin Leu Arg Ala Ala lie His Asn Arg Lys 
260 265 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

Pro Thr Pro Glu Leu Thr Ala Pro Ser^ Ser Asp Ala Ala Ala Lys Arg 
1 5 /in 15 

Glu Ala Arg Arg Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys Ala 
20 /25 30 

lie Asp Glu Ala Gly Gly Ala Asp Asp Trp Val Gin Ala Gin Leu Val 
35 40 45 

Ser Lys Gly Leu Ala Val Glu/Asp Leu Asp Phe Ser Ser Ala Ser Glu 
50 55/ 60 

Lys Asp Lys Lys Ala Trp Lys Glu Lys Lys Lys Ala Glu Ala Thr Glu 
65 70 / 75 80 

Arg Arg Ala Leu Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr His 
85 / 90 95 

Val Gly His Leu Gly Ala Gly Val His Trp Ala Glu Asp Arg Leu 
100 / 105 110 

(2) INFORMATION FOR SEQ' ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:/ 110 amino acids 

(B) TYPE; ainino acid 
(D) TOPOLCpY: linear 

(ii) MOLECULE TYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Asp Pro Asp Met Thr Arg Val Thr Asn Ser Pro Ser Leu Gin Ala 
15 10 15 

His Leu Gin Ala Leu Tyr Leu Val Gin His Glu Val Trp Arg Pro Leu 
20 25 30 

Ala Ala Ala Tyr Gin Glu Gin Leu Asp Arg Pro Val Val Pro h/s Pro 
35 40 45 

Tyr Arg Val Gly Asp Thr Val Trp Val Arg Arg His Gin Th/ Lys Asn 
50 55 60 

Leu Glu Pro Arg Trp Lys Gly Pro Tyr Thr Val Leu Leu/Thr Thr Pro 
65 70 75 / 80 

Thr Ala Leu Lys Val Asp Gly He Ala Ala Trp He aife Ala Ala His 
85 90 // 95 

Val Lys Ala Ala Asp Pro Gly Gly Gly Pro Ser Ser Arg Leu 
100 105 / 110 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein? 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Lys Asp Ala Pro Ala Ala Arg/val Pro Arg Asp Val Val Arg Gin 
1 5 7 10 15 

Leu Arg Ala Ala He His Asn A^rg Lys Lys Gly Lys Pro Gly Arg Glu 
20 / 25 30 

Gly Glu Ser Leu Glu Gin Lej/ Lys Gly Met Ala Ala Phe He His Met 
35 / 40 45 

Thr Asp Pro Ala Lys Gly Arg Ala Phe Leu Ala Gin Leu Thr Glu Leu 
50 /55 60 



/: 



Glu Ser Thr Ala Ser Ala Ala Pro Gin Ala Glu 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 9 



(i) SEQUENCE CHARA9fTERISTICS : 

(A) LENGTH: 6^6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY;,: linear 

(ii) MOLECULE TYPE: protein 



0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Gly Lys Glu Gly His Ser Ala Arg Gin Cys Arg Ala Pro Arg Arg Gin 
15 10 15 

Gly Cys Trp Lys Cys Gly Lys Pro Gly His He Met Thr Asn Cys ^ro 
20 25 30 

Asp Arg Gin Ala Gly Phe Leu Gly Leu Gly Pro Trp Gly Lys L^^s Pro 
35 40 45 

Arg Asn Phe Pro Val Ala Gin Val Pro Gin Gly Leu Thr Pro/Thr Ala 
50 55 

Pro Pro 
65 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Gly Pro Arg Ala Leu Pro Gin Gly Ala Pro Thr Ser Pro Gly He Thr 
1 5 lb 15 

Asn Ala Leu Cys Leu Lys Leu Asp Lys'^rg Leu Ser Ala Leu Ala Lys 
20 2j 30 

Arg Leu Gly Phe Thr Tyr^ Thr Arg Tyr Ala Asp Asp Leu Thr Phe Ser 
35 40 / 45 



Trp Thr Lys Ala Lys Gin Pro Lys Pro Arg Arg Thr Gin Arg Pro Pro 
50 55 / 60 

Val Ala Val Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 11: 

/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

Tyr Asn Gly Thr Leu Pro Gin Gly Ser Pro Cys Ser Pro He He Ser 
1 5 10 15 

Asn Leu He Cys Asn He Met Asp Met Arg Leu Ala Lys Leu Ala Lys 
20 25 30 



Lys Tyr Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp lie Thr lie Ser 
35 40 45 

Thr Asn Lys Asn Thr Phe Pro Leu Glu Met Ala Thr Val Gin Pro Glu 
50 55 60 

Gly Val Val Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr Lys Asn Leu Leu Pro Gin Gly Ala Pro Ser Seif Pro Lys Leu Ala 
1 5 10 / / 15 




Asn Leu lie Cys Ser Lys Leu Asp Tyr Arg Ile/Gln Gly Tyr Ala Gly 
20 25 / / /30 

Ser Arg Gly Leu lie Tyr Thr Arg Tyr Ala A^p/Asp/Leu Thr Leu Ser 
35 40 / 45 

Ala Gin Ser Met Lys Lys Val Val Lys KlA Arg Asp Phe Leu Phe Ser 
50 55 / 60 

lie lie Pro Ser 
65 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 cunino acid^ 

(B) TYPE: cunino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION/: SEQ ID NO: 13: 

Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser Pro Ala lie 
1 5 / 10 15 

Phe Gin Ser Ser Met Thr Lys lie Leu Glu Pro Phe Lys Lys Gin Asn 
20 I 25 30 

Pro Asp lie Val lie Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser 
35 / 40 45 

Asp Leu Glu lie Gly Gin His Arg Thr Lys lie Glu Glu Leu Arg Gin 
50 / 55 60 



His Leu Leu 
65 



(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Tyr Ala Trp Lys Val Leu Pro Gin Gly Phe Lys Asn Se/ Pro Thr Leu 
1 5 10 / 15 

Phe Glu Met Gin Leu Ala His lie Leu Gin Pro Ile^fc^ Gin Ala Phe 
20 25 // ^ 30 



Pro Gin Cys Thr lie Leu Gin Tyr Met Asp Asp/ple Leu Leu Ala Ser 



35 



Pro Ser His Glu Asp Leu Leu Leu Leu Ser 
50 55 



45 

rhr Met Ala Ser 



Leu lie 
65 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Leu Thr Trp Thr Arg Leu Prt> Gin Gly Phe Lys Asn Ser Pro Thr Leu 
1 5/10 15 

Phe Asp Glu Ala Leu His/Arg Asp Leu Ala Asp Phe Arg lie Gin His 
20 / 25 30 

Pro Asp Leu lie Leu l4u Gin Tyr Val Asp Asp Leu Leu Leu Ala Ala 
35 / 40 45 

Thr Ser Glu Leu Asp^ Cys Gin Gin Gly Thr Arg Ala Leu Leu Gin Thr 
50 7 55 60 

Leu 
65 

(2) INFORMATION FOR ^EQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYP^: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Phe Gin Trp Lys Val Leu Pro Gin Gly Met Thr Cys Ser Pro Thr/lle 
is 10 15y 

Cys Gin Leu Val Val Gly Gin Val Leu Glu Pro Leu Arg Leu Ws His 
20 25 30 

Pro Ser Leu Cys Met Leu His Tyr Met Asp Asp Leu Leu L^^i Ala Ala 
35 40 45 

Ser Ser His Asp Gly Leu Glu Ala Ala Gly Glu Glu Val/lle Ser Thr 
50 55 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:r7j 

Phe Ala Trp Arg Val Leu Pro Gin Gly/ Phe He Asn Ser Pro Ala Leu 
1 5 7 10 15 

Phe Glu Arg Ala Leu Gin Glu Pro ^bu Arg Gin Val Ser Ala Ala Phe 
20 /^5 30 

Ser Gin Ser Leu Leu Val Ser Tyr Met Asp Asp He Leu Tyr Ala Ser 
35 4Q 45 

Pro Thr Glu Glu Gin Arg Ser Gin Cys Tyr Gin Ala Leu Ala Ala Arg 
50 55 / 60 

Leu 
65 

(2) INFORMATION FOR SEQ ID No/: 18 





(i) SEQUENCE CHARACTER j/ST I CS : 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

He Ala Thr Asn 91y Val Pro Gin Gly Ala Ser Thr Ser Cys Gly Leu 
1 5 10 15 



• 



Ala Thr Tyr Asn Val Leu Glu Leu Phe Leu Arg Tyr 
20 25 

Met Tyr Ala Asp Asp Gly lie Leu Cys Arg Gin Asp 
35 40 

Asp Phe Ser Val Glu Glu Ala Gly Val Val Gin Gl 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Tyr Glu Tyr Leu Arg Met Pro Phe Gly Leu Lys Asn^Ala Pro Ala Thr 

^5 10 77 15 

Phe Gin Arg Cys Met Asn Asp He Leu Arg Pro Leu LetfAsn Lys His 

20 25 // / 30 





Cys Leu Val Tyr Leu Asp Asp He He Val Pl^x^r Thr Ser Leu Asp 
35 40 / 45 

Glu His Leu Gin Ser Leu Gly Leu Val Phfe Glu Lys Leu 
50 55 / 60 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Tyr Glu Phe Cys Arg Leu Pro Phe Gly Leu Arg Asn Ala Ser Ser He 
^ 5 / 10 15 

Phe Gin Arg Ala Leu Asp Asp Val Leu Arg Glu Gin He Gly Lys He 
20 / 25 30 

Cys Tyr Val Tyr Val Asp Asp Val He He Phe Ser Glu Asn Glu Ser 
35 / 40 45 

Asp His Val Arg His lie Asp Thr Val Leu Lys Cys Leu 
50 . / 55 60 



• 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Cys Lys Leu Asn Lys Ala lie Tyr Gly Leu Lys Gin Ala l{la Arg Cys 
1 5 10 / 15 

Trp Phe Arg Cys He Tyr He Leu Asp Lys Gly Asn Il4 Asn Glu Asn 
20 25 / /30/ 

He Tyr Val Leu Leu Tyr Val Asp Asp Val Val Ile^Ala Thr Gly Asp 
35 40 7^45 

Met Thr Arg Met Asn Asn Phe Lys Arg Tyr Leu Mel: Glu Lys Phe 
50 55 /so 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID/NO: 22: 




/ 



Cys Leu Leu Lys Lys Ser Leu Tyr Gly Leu Lys Gin Ser Pro Arg Gin 
* 1 5 / 10 15 



Trp Asn Ala Cys Val Tyr Val Lys Gin Val Ser Glu Gin Glu His Leu 
20 / 25 30 

Tyr Leu Leu Leu Tyr Val Asp Asp Met Leu He Ala Gly Lys Ser Lys 
35 / 40 45 



/ 



Ser Glu He Asn Lys Val Lys Glu Gin Leu Ser Met Glu Phe 
50 /55 60 

(2) INFORMATION FOR SEQ ID NO: 23: 

/ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

lie Arg Leu Lys Lys Ser Leu Tyr Glu Leu Lys Gin Ser Gly Ala j(sn 
15 10 15 

Trp Tyr Glu Glu Val Arg Gly Trp Ser Cys Val Phe Lys Asn Ser Gin 
20 25 30 

Val Thr lie Cys Leu Phe Val Asp Asp Met Val Leu Phe Sex/ Lys Asn 
35 40 45 / 

Leu Asn Ser Asn Lys Arg lie lie Glu Lys Leu Lys Met yGln Tyr 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /note= "The^2' position of this 
nucleotide is linked to tl^ 5' position of 
nucleotide number 1 of SEfi ID NO: 25 of this 
application. " 

(ix) FEATURE: 

(A) NAME/KEY: misc_binding 

(B) LOCATION: 52.. 58 

(D) OTHER INFORMATION: /not^- "This region can hydrogen 

bond to nucleotides y61-67 of SEQ ID NO: 25 of this 
application. 

(xi) SEQUENCE DESCRIPTION: SEQ^ ID NO:24: 
CACGCAUGUA GGCAGAUUUG UUGGUUGUGAykuCGCAACCA GUGGCCUUAA UGGCAGGA 58 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 base/ pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION: 3L 

(D) OTHER INFORMATION: /note= "The 5' position of this 
nucleo1:ide is linked to the 2' position of 
nucleotide number 15 of SEQ ID NO: 24 of this 
application. " 




0 



( ix ) FEATURE : 

(A) NAME/KEY: misc_binding 

(B) LOCATION: 61.. 67 

(D) OTHER INFORMATION: /note= "This region can hydrogen 

bond to nucleotides 52-58 of SEQ ID NO: 24 of th^s 
application. " 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCCTTCGCAC AGCACACCTG CCGTATAGCT CTGAATCAAG GATTTTAGGG AGG^ATTCC 
TCCTGCC 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(P) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 418.. 2175 



60 
67 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

TGGCCATTNA GATACGGATT TTCACTTCCT TGACAGTGCA 

GCATGATCGA TTGAGGATCG TCTTTGCTCA GATCCGCCAG 

TGTCATGCAT GTGCATGAAA ACCACTGCAT AAA^CGGGCA 

GCGCGCTATC ACCGAAAATA GCCAAAATAC TTtTGGAAAA 

TTCATAAACA CGCATGTAGG CAGATTTGTT .GGTTGTGAAT 

GCAGGAGGAA TCGCCTCCCT AAAATCCTT^ ATTCAGAGCT 

GAAGGAGTGC CTGCATGCGT TTCTCCT-jfeG CCTTTTTTCC 

ATG ACA AAA ACA TCT AAA CTT jSAC GCA CTT AGG 
Met Thr Lys Thr Ser Lys Leu /Asp Ala Leu Arg 
15/ 10 



GAA GAC TTG GCT AAA ATT 
Glu Asp Leu Ala Lys lie 
20 



'A GAT ATT AAG TTG 
ieu Asp lie Lys Leu 
25 



GTT CTA TAT AGA ATC GGC TCG GAT AAT CAA TAC 
Val Leu Tyr Arg He Gl/y Ser Asp Asn Gin Tyr 
35 / 40 



CCG AAG AAA GGA AAA yGGG GTA AGG ACT ATT TCT 
Pro Lys Lys Gly Lys/Gly Val Arg Thr He Ser 
50 / 55 

TTG AAG GAC ATC cA CGA AGA ATA TGT GAC TTA 
Leu Lys Asp lie oln Arg Arg He Cys Asp Leu 
65 / 70 75 



TGACTATGCT GCATGAAATN 60 

AACTGGCGGG CTTTTGCTCA 120 

GGCGTGGCGG GGATACGAGC 180 

CAGAAAGTTG AAGTGATATG 240 

CGCAACCAGT GGCCTTAATG 300 

ATACGGCAGG TGTGCTGTGC 360 

TCTGGGATGA AGAAGAA 417 

GCT GCT ACT TCA CGT 465 
Ala Ala Thr Ser Arg 
15 

GTA TTT TTA ACT AAC 513 
Val Phe Leu Thr Asn 
30 

ACT CAA TTT ACA ATA 561 
Thr Gin Phe Thr He 
45 

GCA CCT ACA GAC CGG 609 
Ala Pro Thr Asp Arg 
60 

CTT TCT GAT TGT AGA 657 
Leu Ser Asp Cys Arg 
80 



• 



GAT GAG ATC TTT GOT ATA AGG AAA ATT AGT AAC AAC TAT TCC TTT GGT 
Asp Glu lie Phe Ala He Arg Lys He Ser Asn Asn Tyr Ser Phe Gly 
85 90 95 

TTT GAG AGG GGA AAA TCA ATA ATC CTA AAT GCT TAT AAG CAT AGA GGC 
Phe Glu Arg Gly Lys Ser He He Leu Asn Ala Tyr Lys His Arg Gly 
100 105 110 

AAA CAA ATA ATA TTA AAT ATA GAT CTT AAG GAT TTT TTT GAA AGC TT 
Lys Gin He He Leu Asn He Asp Leu Lys Asp Phe Phe Glu Ser Rhe 
115 120 125 




AAT TTT GGA CGA GTT AGA GGA TAT TTT CTT TCC AAT CAG GAT TT;* TTA 
Asn Phe Gly Arg Val Arg Gly Tyr Phe Leu Ser Asn Gin Asp Phe^iLeu 
130 135 140 



849 



TTA AAT CCT GTG GTG GCA ACG ACA CTT GCA AAA GCT GCA TGG' TAT AAT 
Leu Asn Pro Val Val Ala Thr Thr Leu Ala Lys Ala Ala C/s Tyr Asn 
145 150 155 / 160 



897 



GGA ACC CTC CCC CAA GGA AGT CCA TGT TCT CCT ATT ATC?" TCA AAT CTA 
Gly Thr Leu Pro Gin Gly Ser Pro Cys Ser Pro He life Ser Asn Leu 
165 170 / 175 

ATT TGC AAT ATT ATG GAT ATG AGA TTA GCT AAG CTg/gCT /AAA AAA TAT 
He Cys Asn He Met Asp Met Arg Leu Ala Lys Leu Ala Lys Lys Tyr 
180 185 f 7 190 

GGA TGT ACT TAT AGC AGA TAT GCT GAT GAT ATA >^CA ATT TCT ACA AAT 
Gly Cys Thr Tyr Ser Arg Tyr Ala Asp Asp He' Thr He Ser Thr Asn 
195 200 / 205 



945 



993 



1041 



AAA AAT ACA TTT CCG TTA GAA ATG GCT ACT 
Lys Asn Thr Phe Pro Leu Glu Met Ala Thry 
210 215 



5TG CAA CCT GAA GGG GTT 
^al Gin Pro Glu Gly Val 
220 



1089 



GTT TTG GGA AAA GTT TTG GTA AAA GAA 
Val Leu Gly Lys Val Leu Val Lys Glu 
225 230 



ATA AAT GAT TCA AAG ACT AGG CTT A' 
He Asn Asp Ser Lys Thr Arg Leu 
245 



TAT AAA AAA ACT CGG GCG 
Tyr Lys Lys Thr Arg Ala 
275 

TAT AAA GTG CCA GAT GAA 
Tyr Lys Val Pro Asp Glu 
290 

AAA CTT GAG GGG ATG TTT 
Lys Leu Glu Gly Met Phe 
305 310j 



TTG GCA 
Leu Ala 

'280 




A GAA AAC TCT GGA TTC GAA 
le Glu Asn Ser Gly Phe Glu 
235 240 



TAT AAG ACA TCA AGG CAA GAA 
r Tyr Lys Thr Ser Arg Gin Glu 
250 255 



GTA ACG GGA CTT ACA GTT AAC AGA/ATC GTT AAT ATT GAT AGA TGT TAT 
Val Thr Gly Leu Thr Val Asn Arcf He Val Asn He Asp Arg Cys Tyr 
260 / 265 270 



AAT GGT 
As4l Gly 
295 



rT TTT 
(Gly Phe 



CAT GCT TTG TAT CGT 
His Ala Leu Tyr Arg 
285 

GTT TTA GTT TCA GGA 
Val Leu Val Ser Gly 
300 

ATT GAT CAA GTT GAT 
He Asp Gin Val Asp 
315 



ACA GGT GAA 
Thr Gly Glu 

GGT CTG GAT 
Gly Leu Asp 



AAG TTT AAC 
Lys Phe Asn 
320 



1137 



1185 



1233 



1281 



1329 



1377 



AAT ATA AAG AAA AAA CT6 AAC AAG CAA CCT GAT AGA TAT GTA TTG ACT 
Asn He Lys Lys Lys Leu Asn Lys Gin Pro Asp Arg Tyr Val Leu Thr 
325 / 330 335 



1425 



AAT GCG ACT TTG CAT GGT TTT AAA TTA AAG TTG AAT GCG CGA GAA AAA 
Asn Ala Thr Leu His Gly Phe Lys Leu Lys Leu Asn Ala Arg Glu Lys 
340 345 350 

GCA TAT AGT AAA TTT ATT TAG TAT AAA TTT TTT CAT GGC AAC ACC TGT 
Ala Tyr Ser Lys Phe lie Tyr Tyr Lys Phe Phe His Gly Asn Thr Cys 
355 360 365 

CCT ACG ATA ATT ACA GAA GGG AAG ACT GAT CGG ATA TAT TTG AAG GCT 
Pro Thr lie lie Thr Glu Gly Lys Thr Asp Arg lie Tyr Leu Lys Ala^ 
370 375 380 

GCT TTG CAT TCT TTG GAG ACA TCA TAT CCT GAG TTG TTT AGA GAA 
Ala Leu His Ser Leu Glu Thr Ser Tyr Pro Glu Leu Phe Arg Glu/ Lys 
385 390 395 / 400 



1473/ 




1569 



1617 



ACA GAT AGT AAA AAG AAA GAA ATA AAT CTT AAT ATA TTT AAA /TCT, AAT 
Thr Asp Ser Lys Lys Lys Glu lie Asn Leu Asn lie Phe Lya/ser Asn 
405 410 / 415 

GAA AAG ACC AAA TAT TTT TTA GAT CTT TCT GGG GGA ACT ^CA GAT CTG 
Glu Lys Thr Lys Tyr Phe Leu Asp Leu Ser Gly Gly Thr/Ala Asp Leu 
420 425 / 430 



AAA AAA 
Lys Lys 



TCT GTT 
Ser Val 
450 



TTT GTA 
Phe Val 
435 

CCA AAA 
Pro Lys 



GAG CGT 
Glu Arg 



CAG CCA 
Gin Pro 



TAT AAA AAT AAT 
Tyr Lys Asn Asn 
440 

GTG ATT ATG GTT 
Val He Met Val 
455 



TAT 
Tyr 



CTT GA^ftAT 
Leu Asp Asn 
/460 




GCT TCT 
Ala /Ser 
'445 



TAT GGT 
Tyr Gly 



ACA GGT 
Thr Gly 



1665 



1713 



1761 



1809 



CCA AGC 
Pro Ser 
465 



GAT TTA 
Asp Leu 



CTT AAT 
Leu Asn 
470 



TTT CTG CGC AAT 
Phe Leu Arg Asn 



AAA GTT AAA AGC 
L^^s Val Lys Ser 



TGC CCA 
Cys Pro 
480 



1857 



GAC GAT GTA ACT 
Asp Asp Val Thr 



AAT TTA TAT ATA 
Asn Leu Tyr He 
500 

TCA ATG GAG GAT 
Ser Met Glu Asp 
515 



GAA ATG 
Glu Met 
485 

GTT CTC 
Val Leu 



CTT TTC 
Leu Phe 



AGA AAG 
Arg Lys 



ACA CCA 
Thr Pro 



CCT AAA 
Pro Lys 
520/ 



ATG AA?V TAT ATT CAT GTT TTC TAT 
Met l/s Tyr He His Val Phe Tyr 
4^90 495 



ttg/agt 

Leu Ser 

SQ^S 



CCT TCC GGC GAA CAA ACT 
Pro Ser Gly Glu Gin Thr 
510 



AT ATT TTA GAT ATC AAG ATT GAT 
''Asp He Leu Asp He Lys He Asp 
525 



1905 



1953 



2001 



GGT AAG AAA TTC AAC AAA AAT AAT GAT GGA GAC TCA AAA ACG GAA TAT 
Gly Lys Lys Phe Asn Lys Asn Asn Asp Gly Asp Ser Lys Thr Glu Tyr 
530 535 / 540 



2049 



GGG AAG CAT ATT TTT TCC ATG/AGG GTT GTT AGA GAT AAA AAG CGG AAA 
Gly Lys His He Phe Ser Met/ Arg Val Val Arg Asp Lys Lys Arg Lys 
545 550 / 555 560 



2097 



ATA GAT TTT AAG GCA TTT TGT TGT ATT TTT GAT GCT ATA AAA GAT ATA 
He Asp Phe Lys Ala Phe Cys Cys He Phe Asp Ala He Lys Asp He 
565 / 570 575 



2145 



AAG GAA CAT TAT AAA TTA ATG TTA AAT AGC TAATGAACAG CCCTAACGTT 
Lys Glu His Tyr Lys Leu Met Leu Asn Ser 



580 



585 



2195 



• 



ATGAACGCTA AGGCTGATTT TTCGTTAAAA TTTATATGGT TTGAATTGTA ATATATTATC 
TTCAAGCCAT TTATTTAATT CCTGCATCCT TTTCTGTAAG GGTATTAATT CGTTCCTCAC 
AAACACTAAA CTCGCTTTTT CCACATCCCC AAACCCCCCT AACATTATTC GGCATAATCC 
CCATCATTTG CGGTGGCACA CGATGCGCTG CCATCATGTC ATCGCGGC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 546 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val^ Lys Gin Trp Pro 
1 5 - 10 / 15 




/ 



Leu Thr Glu Glu Lys He Lys Ala Leu Val Glu /lie Cys Thr Glu Met 

20 25 y/ / 30 - 

Glu Lys Glu Gly Lys He Ser Lys He Gly Pro g/\x Asn Pro Tyr Asn 
35 40 // / 45 

Thr Pro Val Phe Ala He Lys Lys Lys Asp^ Ser Thr Lys Trp Arg Lys 
50 55 / 60 



/ 



Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu 
65 70 / 75 80 

Val Gin Leu Gly He Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser 
85 / 90 95 

Val Thr Val Leu Asp Val Gly Asp /Ala Tyr Phe Ser Val Pro Leu Asp 
100 /l05 110 

Glu Asp Phe Arg Lys Tyr Thr Al4 Phe Thr He Pro Ser He Asn Asn 
115 120 125 

Glu Thr Pro Gly He Arg Tyr .Gin Tyr Asn Val Leu Pro Gin Gly Trp 
130 135/ 140 

Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu 
145 ISO / 155 160 

Pro Phe Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met Asp 
165 170 175 

Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin His Arg Thr Lys 
180 185 190 

He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro 
195 200 205 



Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu 
210 215 220 



Leu His Pro Asp Lys Trp Thr Val Gin Pro He Val Leu Pro Glu Lys 
225 230 235 240 

Asp Ser Trp Thr Val Asn Asp He Gin Lys Leu Val Gly Lys Leu As 
245 250 255 

Trp Ala Ser Gin He Tyr Pro Gly He Lys Val Arg Gin Leu Cys yi,ys 
260 265 270 / 

Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val He Pro Leu Thr Glu 

275 280 285 / 



Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu He Leu Ly^s Glu Pro 
290 295 300 

Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
305 310 315 

Gin Lys Gin Gly Gin Gly Gin Trp Thr Tyr Gin II 
325 330 



Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Mi 
340 345 



Thr Asn Asp Val Lys Gin Leu Thr Glu Ala Val^ 
355 360 




Ala His 



le Thr Thr 



Glu Ser He Val He Trp Gly Lys Thr Pro iSy^Phe Lys Leu Pro He 
370 375 / 380 



Gin Lys Glu Thr Trp Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr 

385 390 / 395 400 

Trp He Pro Glu Trp Glu Phe Val Asn/Thr Pro Pro Leu Val Lys Leu 

405 / 410 415 



Trp Tyr Gin Leu Glu Lys Glu Pro ^4e Val Gly Ala Glu Thr Phe Tyr 

430 



420 425 



Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly Lys Ala Gly Tyr 
435 44b 445 



Val Thr Asn Lys Gly Arg Gin Lys Val Val Pro Leu Thr Asn Thr Thr 
450 455/ 460 

Asn Gin Lys Thr Glu Leu Glh Ala He Tyr Leu Ala Leu Gin Asp Ser 

465 470 / 475 480 



/ 



Gly Leu Glu Val Asn He yVal Thr Asp Ser Gin Tyr Ala Leu Gin He 
485 / 490 495 



He Gin Ala Gin Pro Asp Lys Ser Glu Ser Glu Leu Val Asn Gin He 
500 / 505 510 



He Glu Gin Leu He Lys Lys Glu Lys Val Tyr Leu Ala Trp Val Pro 

515 ^ 520 525 

Ala His Lys Gly He Gly Gly Asn Glu Gin Val Asp Lys Leu Val Ser 
530 535 540 



Ala Gly 
545 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Arg Pro Trp Ala Arg Thr Pro Pro Lys Ala Pro Arg Asn/cin Pro Val 
15 10 / y^^^ 

Pro Phe Lys Pro Glu Arg Leu Gin Ala Leu Gin His I^eu/Val Arg Lys 
20 25 / / 30 

Ala Leu Glu Ala Gly His He Glu Pro Tyr Thr G/y/Pro Gly Asn Asn 

35 40 / 7 45 

Pro Val Phe Pro Val Lys Lys Ala Asn Gly Thi^ /Trp Arg Phe He His 
50 55 //60 

Asp Leu Arg Ala Thr Asn Ser Leu Thr Ile/AsD,L^u Ser Ser Ser Ser 
65 -70 / 75-^ 80 

Pro Gly Pro Pro Asp Leu Ser Ser Leu Pto Thr Thr Leu Ala His Leu 
85 y90 95 

Gin Thr He Asp Leu Arg Asp Ala Ph^ Phe Gin He Pro Leu Pro Lys 
100 lp5 110 

Gin Phe Gin Pro Tyr Phe Ala Phe/Thr Val Pro Gin Gin Cys Asn Tyr 

115 12Cr 125 

Gly Pro Gly Thr Arg Tyr Ala T^p Lys Val Leu Pro Gin Gly Phe Lys 

130 135 / 140 



Asn Ser Pro Thr Leu Phe Gl\x Met Gin Leu Ala His He Leu Gin Pro 
145 150 / 155 160 



/ 



He Arg Gin Ala Phe Pro Gin Cys Thr He Leu Gin Tyr Met Asp Asp 
165 / 170 175 



/ 



He Leu Leu Ala Ser Pro Ser His Glu Asp Leu Leu Leu Leu Ser Glu 
180 / 185 190 



/ 



Ala Thr Met Ala Ser Leu He Ser His Gly Leu Pro Val Ser Glu Asn 
195 / 200 205 



/ 



Lys Thr Gin Gin Thr Pro Gly Thr He Lys Phe Leu Gly Gin He He 
210 / 215 220 



Ser Pro Asn His Leu Thr Tyr Asp Ala Val Pro Thr Val Pro He Arg 
225 / 230 235 240 



Ser Arg Trp Ala, Leu Pro Glu Leu Gin Ala Leu Leu Gly Glu He Gin 
245 250 255 



Trp Val Ser Lys Gly Thr Pro Thr Leu Arg Gin Pro Leu His Ser Leu 
260 265 . 270 



Tyr Cys Ala Leu Gin Arg His Thr Asp Pro Arg Asp Gin lie Tyr Leu 
275 280 285 

Asn Pro Ser Gin Val Gin Ser Leu Val Gin Leu Arg Gin Ala Leu ^r 
290 295 300 

Gin Asn Cys Arg Ser Arg Leu Val Gin Thr Leu Pro Leu Leu GI^ Ala 
305 310 315 / 320 

lie Met Leu Thr Leu Thr Gly Thr Thr Thr Val Val Phe Gln/ser Lys 
325 330 / 335 

Glu Gin Trp Pro Leu Val Trp Leu His Ala Pro Leu Pro ELs Thr Ser 
340 345 7350 

Gin Cys Pro Trp Gly Gin Leu Leu Ala Ser Ala Val L^i,i--Leu Leu Asp 
355 360 ies 

Lys Tyr Thr Leu Gin Ser Tyr Gly Leu Leu Cys Thr lie His His 

370 375 



/, 



Asn He Ser Thr Gin Thr Phe Asn Gin Phe i;i^/Gln Thr Ser Asp His 

385 390 /sy I 400 

Pro Ser Val Pro He Leu Leu His His Se/ H^s Arg^he Lys Asn Leu 
405 41^ / y 415 



Gly Ala Gin Thr Gly Glu Leu Trp Asn 
420 425 1 



le Leu Lys Thr Ala Ala 
430 



Pro Leu Ala Pro Val Lys Ala Leu Met /Pro Val Phe Thr Leu Ser Pro 
435 440 / 445 

Val He He Asn Thr Ala Pro Cys Leu Phe Ser Asp Gly Ser Thr Ser 
450 455 / 460 

Arg Ala Ala Tyr He Leu Trp hs4 Lys Gin He Leu Ser Gin Arg Ser 
465 470 / 475 480 

Phe Pro Leu Pro Pro Pro His l!.ys Ser Ala Gin Arg Ala Glu Leu Leu 
485 / 490 495 

Gly Leu Leu His Gly Leu Set Ser Ala Arg Ser Trp Arg Cys Leu Asn 
500 / 505 510 

He Phe Leu Asp Ser Lys/Tyr Leu Tyr His Tyr Leu Arg Thr Leu Ala 
515 / 520 525 

Leu Gly Thr Phe Gin Gl^y Arg Ser Ser Gin Ala Pro Phe Gin Ala Leu 
530 / 535 540 

Leu Pro Arg Leu Leu/ Ser Arg Lys Val Val Tyr Leu His His Val Arg 
545 / 550 555 560 

Ser His Thr Asn Lj^u Pro Asp Pro He Ser Arg Leu Asn Ala Leu Thr 

570 575 



Asp Ala 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asn Val Leu Tyr Arg lie Gly Ser Asp Asn Gin Tyr Thr GlnA'he Thr 
1 5 10 /is 

lie Pro Lys Lys Gly Lys Gly Val Arg Thr lie Ser Ala pro Thr Asp 
20 25 

Arg Leu Lys Asp lie Gin Arg Arg lie Cys Asp Leu Le6 Ser^ Asp Cys 



35 



40 



/ 



Arg Asp Glu lie Phe Ala lie Arg Lys lie Ser Asn/As/i Tyr Ser Phe 
50 55 60/ 

Gly Phe Glu Arg Gly Lys Ser lie lie Leu Asn iQa/Tyr Lys His Arg 
65 70 75 / / ^ 80 



Gly Lys Gin lie lie Leu Asn lie Asp Leu Lys^ 
85 90 



?he 



Phe Glu Ser 
95 



Phe Asn Phe Gly Arg Val Arg Gly Tyr Phe/l^e^r Ser Asn Gin Asp Phe 
100 105 / 110 

Leu Leu Asn Pro Val Val Ala Thr Thr uexi Ala Lys Ala Ala Cys Tyr 
115 120 / 125 

Asn Gly Thr Leu Pro Gin Gly Ser Pr6 Cys Ser Pro lie lie Ser Asn 
130 135 / 140 

Leu lie Cys Asn lie Met Asp Met Arg Leu Ala Lys Leu Ala Lys Lys 
145 150 / 155 160 

Tyr Gly Cys Thr Tyr Ser Arg T^r Ala Asp Asp lie Thr lie Ser Thr 
165 / 170 175 

Asn Lys Asn Thr Phe Pro Leii/ Glu Met Ala Thr Val Gin Pro Glu Gly 
180 / 185 190 

Val Val Leu Gly Lys Val l/eu Val Lys Glu lie Glu Asn Ser Gly Phe 
195 / 200 205 

Glu lie Asn Asp Ser Lys^^ Thr Arg Leu Thr Tyr Lys Thr Ser Arg Gin 
210 / 215 220 

Glu Val Thr Gly Leu T^ir Val Asn Arg lie Val Asn lie Asp Arg Cys 
225 i30 235 240 

Tyr Tyr Lys Lys Thr^ Arg Ala Leu Ala His Ala Leu Tyr Arg Thr Gly 

250 255 



Glu Tyr Lys Val Pfo Asp Glu Asn Gly Val Leu Val Ser Gly Gly Leu 
260 / 265 270 



• 



Asp Lys Leu Glu Gly Met Phe Gly Phe lie Asp Gin Val Asp Lys Phe 
275 280 285 

Asn Asn lie Lys Lys Lys Leu Asn Lys Gin Pro Asp Arg Tyr Val ]/eu 
290 295 300 

Thr Asn Ala Thr Leu His Gly Phe Lys Leu Lys Leu Asn Ala h/g Glu 
305 310 315 / 320 

Lys Ala Tyr Ser Lys Phe lie Tyr Tyr Lys Phe Phe His Gl^ Asn Thr 
325 330 / 335 

Cys Pro Thr He He Thr Glu Gly Lys Thr Asp Arg Ile^Tyr Leu Lys 
340 345 / 350 

Ala Ala Leu His Ser Leu Glu Thr Ser Tyr Pro Glu /Leu Phe Arg Glu 
355 360 / 363 

Lys Thr Asp Ser Lys Lys Lys Glu He Asn Leu ii^sn^Ile Phe Lys Ser 
370 375 /s&b ! 

Asn Glu Lys Thr Lys Tyr Phe Leu Asp Leu s/r/Gly Gly Thr Ala Asp 
385 390 3^/ 400 

Leu Lys Lys Phe Val Glu Arg Tyr Lys Asi/ ^^sn Tyr/'Ala Ser Tyr Tyr 
405 4^0/ / 415 



Gly Ser Val Pro Lys Gin Pro Val Ile^ 
420 42E 



Leu Asp Asn Asp Thr 
430 



Gly Pro Ser Asp Leu Leu Asn Phe L^u Arg Asn Lys Val Lys Ser Cys 
435 440 / 445 

Pro Asp Asp Val Thr Glu Met Arq^ Lys Met Lys Tyr He His Val Phe 
450 455 / 460 

Tyr Asn Leu Tyr He Val Leu Thr Pro Leu Ser Pro Ser Gly Glu Gin 
465 470 / 475 480 

Thr Ser Met Glu Asp Leu Phfe Pro Lys Asp He Leu Asp He Lys He 
485 / 490 495 

Asp Gly Lys Lys Phe Asn Zys Asn Asn Asp Gly Asp Ser Lys Thr Glu 
500 / 505 510 

Tyr Gly Lys His He Ph4 Ser Met Arg Val Val Arg Asp Lys Lys Arg 
515 / 520 525 

Lys He Asp Phe Lys Ala Phe Cys Cys He Phe Asp Ala He Lys Asp 
530 / 535 540 

He Lys Glu His Tyr^ Lys Leu Met Leu Asn Ser 
545 / 550 555 

(2) INFORMATION FOR Se6 ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:/ 243 amino acids 

(B) TYPE: ^inino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



t 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Arg Trp Phe Ser Phe His Arg Glu Val Asp Thr Gly Thr His Tyr ^In 
15 10 15 

Thr Trp Glu lie Pro Lys Arg Asp Gly Gly Lys Arg Thr Leu T^r Ala 
20 25 30 

Pro Lys Arg Glu Leu Lys Ala Val Gin Arg Trp Val Leu Al^ Asn Val 
35 40 45 

Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Ph^Val Ala Gly 
50 55 60 

Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly Ala. Asp Val Val 
65 70 75 / 80 

Val Lys Val Asp Met Lys Asp Phe Phe Pro Ser Val Thr Trp Pro Arg 

85 90 // I 95 

Val Lys Gly Leu Leu Arg Lys Gly Gly Leu VrM Glu Asn Leu Ala Thr 
100 105 / 110 

Leu Leu Ala Leu Leu Ser Thr Glu Ala Pro/Arg Glu Val Val Arg Phe 
115 120 // y 125 

Arg Gly Glu Thr Leu Tyr Val Ala Lys <3ly Pro Arg Ala Leu Pro Gin 
130 135 // / 140 

Gly Ala Pro Thr Ser Pro Ala Leu Thr iCsn Ala Leu Cys Leu Arg Leu 
145 150 / 155 160 

Asp Lys Arg Leu Ser Ala Leu Ser/Lys Arg Leu Gly Phe Thr Tyr Thr 
165 / 170 175 

Arg Tyr Ala Asp Asp Leu Thr Dne Ser Trp Arg Arg Ala Lys Lys Ser 
180 / 185 190 

Arg Gin Lys Glu Leu Pro Lei/ Ala Asp Ala Pro Val Ala Leu Leii Leu 
195 / 200 205 

Ala Arg Val Lys Gly Val I^eu Glu Ala Glu Gly Phe Thr Leu His Pro 
210 y215 220 

Asp Lys Thr Arg Val Glr{ Arg Lys Gly Ser Arg Gin Arg Val Thr Gly 
225 23b 235 240 

Leu Val Val 



(2) INFORMATION FOR SEQ Ad NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOgV: linear 

(ii) MOLECULE TYPE: protein 



• 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Arg Trp Phe Ala Phe His Arg Glu Val Asp Thr Ala 
15 10 

Ser Trp Thr lie Pro Lys Arg Asp Gly Ser Lys Arg 
20 25 

Pro Lys Pro Glu Leu Lys Ala Ala Gin Arg Trp Val 
35 40 



Thr His Tyr Va] 
15 

Thr lie Thr/Ser 
30 

Leu Ser ^fen Val 
45 



Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe v/l Ala Gly 
50 55 60 



Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly 
65 70 75 

Val Lys Val Asp Leu Lys Asp Phe Phe Pro Ser Val 
85 90 



Val Lys Gly Leu Leu Arg Lys Gly Gly Leu 
100 105 

Leu Leu Ser Leu Leu Ser Thr Glu Ala Pro 
115 120 

Pro Arg Glu Leu Leu His Val Ala Lys 
130 135 

Gly Ala Pro Thr Ser Pro Gly lie Thr Kan 
145 150 / 155 

Asp Lys Arg Leu Ser Ala Leu Ala lV& Arg Leu Gly 
165 / 170 

Arg Tyr Ala Asp Asp Leu Thr Phe/ser Trp Thr Lys 
180 / 185 

Lys Pro Arg Arg Thr Gin Arg Pto Pro Val Ala Val 
195 AOO 




Al/ Asp Val Val 
80 

hr Trp Arg Arg 
95 

Gly Thr Ser Thr 
110 

Ala Val Gin Phe 
125 

Ala Leu Pro Gin 



Cys Leu Lys Leu 
160 

Phe Thr Tyr Thr 
175 

Ala Lys Gin Pro 
190 

Leu Leu Ser Arg 
205 



Val Gin Glu Val Val Glu Aid Glu Gly Phe Arg Val His Pro Asp Lys 
210 2X/5 220 



Thr Arg Val Ala Arg Lys Gly Thr Arg Gin Arg Val 
225 230/ 235 



Thr Gly Leu Val 
240 



Val 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23^1 amino acids 

(B) TYPE: amino acid 
(D) topology/ linear 



(ii) MOLECULE TYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Arg His Tyr Ser lie His Arg Pro Arg Glu Arg Val Arg His Tyr Val 
1 5 10 - 15 

Thr Phe Ala Val Pro Lys Arg Ser Gly Gly Val Arg Leu Leu His Al 
20 25 30 

Pro Lys Arg Arg Leu Lys Ala Leu Gin Arg Arg Met Leu Ala Leu/Leu 
35 40 45 

Val Ser Lys Leu Pro Val Ser Pro Gin Ala His Gly Phe Val ^ro Gly 
50 55 60 

Arg Ser He Lys Thr Gly Ala Ala Pro His Val Gly Arg Arg Val Val 
65 70 75 / 80 

Leu Lys Leu Asp Leu Lys Asp Phe Phe Pro Ser Val Thi/ Phe Ala Arg 
85 90 / /j 95 

Val Arg Gly Leu Leu Lys Ala Leu Gly Tyr Gly Tyr ^ro\ Val Ala Ala 
100 105 / / I 110 

Thr Leu Ala Val Leu Met Thr Glu Ser Glu Arg Gin Pro Val Glu Leu 
115 120 // 125 

Glu Gly He Leu Phe His Val Pro Val Gly Pro /Arg Val Cys Val Gin 
130 135 // 140, 



Gly Ala Pro Thr Ser Pro Ala Leu Cys Asn 
145 150 




al Leu Leu Arg Leu 
160 



Asp Arg Arg Leu Ala Gly Leu Ala Arg Arg Tyr Gly Tyr Thr Tyr Thr 
165 ^70 175 

Arg Tyr Ala Asp Asp Leu Thr Phe Sef Gly Asp Asp Val Thr Ala Leu 
180 \JS 190 

Glu Arg Val Arg Ala Leu Ala Ala Arg Tyr Vai Gin Glu Glu Gly Phe 
195 200/ 205 

Glu Val Asn Arg Glu Lys Thr Krh Val Gin Arg Arg Gly Gly Ala Gin 
210 215 / 220 

Arg Val Thr Gly Val Thr Val 
225 230 

(2) INFORMATION FOR SEQ ID NO: 3^3 

(i) SEQUENCE CHARACTERIs/lCS : 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acxd 
(D) TOPOLOGY: lir^ear 




(ii) MOLECULE TYPE: pri)tein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Phe Leu Thr Asn Val Leu Tyr Arg He Gly Ser Asp Asn Gin Tyr Thr 
1 5/ 10 15 



Gin Phe Thr He Pro Lys Lys Gly Lys Gly Val Arg Thr He Ser Ala 
20 25 30 

Pro Thr Asp Arg Leu Lys Asp He Gin Arg Arg lie Cys Asp Leu Leu 
35 40 45 

Ser Asp Cys Arg Asp Glu He Phe Ala He Arg Lys He Ser Asn A^n 

50 55 60 

Tyr Ser Phe Gly Phe Glu Arg Gly Lys Ser He He Leu Asn Ala Tyr 
65 70 75 / 80 

Lys His Arg Gly Lys Gin He He Leu Asn He Asp Leu Ly^ Asp Phe 
85 90 / 95 



Phe Glu Ser Phe Asn Phe Gly Arg Val Arg Gly Tyr Pher Leu Ser Asn 
100 105 7 110 

Gin Asp Phe Leu Leu Asn Pro Val Val Ala Thr Thr/Leu'^Vla Lys Ala 
115 120 / 12^5 

Ala Cys Tyr Asn Gly Thr Leu Pro Gin Gly Ser y^ro/Cys Ser Pro He 
130 135 /l4tf 



./ 



He Ser Asn Leu He Cys Asn He Met Asp Met Arg Leu Ala Lys Leu 
145 150 Ass / , 160 



Ala Lys Lys Tyr Gly Cys Thr Tyr Ser Aj?g 
165 1/70 

He Ser Thr Asn Lys Asn Thr Phe Pre/ Leu 

180 lai 




k Asp Asp He Thr 
175 

Met Ala Thr Val Gin 
190 



Pro Glu Gly Val Val Leu Gly Lys /Val Leu Val Lys Glu He Glu Asn 
195 200/ 205 

Ser Gly Phe Glu He Asn Asp Ser Lys Thr Arg Leu Thr Tyr Lys Thr 
210 215 / 220 

Ser Arg Gin Glu Val Thr Glj/Leu Thr Val 
225 230 

INFORMATION FOR SEQ ID NO/34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 anano acids 

(B) TYPE: amino a^id 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: p/otein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Val Glu Thr Leu A'rg Leu Leu He Tyr Thr Ala Asp Phe Arg Tyr Arg 
15 10 15 

He Tyr Thr Val/oiu Lys Lys Gly Pro Glu Lys Arg Met Arg Thr He 
20 / 25 30 

Tyr Gin Pro Ser Arg Glu Leu Lys Ala Leu Gin Gly Trp Val Leu Arg 
35 / 40 45 



Asn lie Leu Asp Lys Leu Ser Ser Ser Pro Phe Ser lie Gly Phe Glu/ 
50 55 60 

Lys His Gin Ser lie Leu Asn Asn Ala Thr Pro His lie Gly Ala Asn 
65 70 75 /so 

Phe lie Leu Asn lie Asp Leu Glu Asp Phe Phe Pro Ser Leu "^r Ala 
85 90 .95 



/ 



Asn Lys Val Phe Gly Val Phe His Ser Leu Gly Tyr Asn Arg Leu lie 
100 105 IdO 



u^Lt 



Ser Ser Val Leu Thr Lys lie Cys Cys Tyr Lys Asn Leu^ Leu Pro Gin 

12^ 

Gly Ala Pro Ser Ser Pro Lys Leu Ala Asn Leu lie vCys] Ser Lys Leu 

130 135 y I 

Asp Tyr Arg lie Gin Gly Tyr Ala Gly Ser Arg Giy Leu lie Tyr Thr 
145 150 155 / / 160 

Arg Tyr Ala Asp Asp Leu Thr Leu Ser Ala Ser Met Lys Lys Val 

165 170 U 175 

Val Lys Ala Arg Asp Phe Leu Phe Ser Ile/lle Pafo Ser Glu Gly Leu 
180 185 // / 190 

Val lie Asn Ser Lys Lys Thr Cys lie Ser G^y Pro Arg Ser Gin Arg 
195 200 // / 205 

Lys Val Thr Gly Leu Val He 
210 215 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 230 amino ac^ids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proteir 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Thr Lys Gly Phe Ala Ser Glu Val Met Arg Ser Pro Glu Pro Pro Lys 
1 5 / 10 15 

Lys Trp Asp He Ala/Lys Lys Lys Gly Gly Met Arg Thr He Tyr His 
20 / 25 30 

Pro Ser Ser Lys Vail Lys Leu He Gin Tyr Trp Leu Met Asn Asn Val 
35 / 40 45 



/ 



Phe Ser Lys Leu Pro Met His Asn Ala Ala Tyr Ala Phe Val Lys Asn 
50 / 55 60 



Arg Ser He Lys Ser Asn Ala Leu Leu His Ala Glu Ser Lys Asn Lys 
65 / 70 75 80 



Tyr Tyr Val Lys He Asp Leu Lys Asp Phe Phe Pro Ser He Lys Phe 
85 90 95 




t 



Thr Asp Phe Glu Tyr Ala Phe Thr Arg Tyr Arg Asp Arg lie Glu Phe 
100 105 110 



Thr Thr Glu Tyr Asp Leu Glu Leu Leu Gin Leu lie Lys Thr 
115 120 125 

Phe lie Ser Asp Ser Thr Leu Pro lie Gly Phe Pro Thr Ser 
130 135 140 

lie Ala Asn Phe Val Ala Arg Glu Leu Asp Glu Lys Leu Tjir 
145 150 155 

Leu Asn Ala lie Asp Lys Leu Asn Ala Thr Tyr Thr Arg' Tyr 
165 170 




lie /Cys 



ro Leu 



Gin Lys 
160 

Ala Asp 
175 



Asp lie lie Val Ser Thr Asn Met Lys Gly Ala Ser/Lys Leu lie Leu 
180 185 / 190 



Asp Cys Phe Lys Arg Thr Met Lys Glu lie Gly Pro /Asp Phe 
195 200 / / 205 

Asn lie Lys Lys Phe Lys lie Cys Ser Ala S^r/fely Gly Ser 
210 215 / / 220 

Val Thr Gly Leu Lys Val 
225 230 

INFORMATION FOR SEQ ID NO: 36: 




Lys lie 
He Val 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: /SEQ ID NO: 36 



He Gin Arg Leu His Ala Leu Ser Asn His Ala Gly Arg His Tyr Arg 

1 5 10 15 

Arg He He Leu Ser Lvs Arg His Gly Gly Gin Arg Leu Val Leu Ala 
20 7 25 30 

Pro Asp Tyr Leu Leu /Lys Thr Val Gin Arg Asn He Leu Lys Asn Val 
35 / 40 45 



Leu Ser Gin Phe Pro Leu Ser Pro Phe Ala Thr Ala Tyr Arg Pro Gly 
50 / 55 60 



Cys Pro He Val Ser Asn Ala Gin Pro His Cys Gin Gin Pro Gin He 
65 / 70 75 80 



/ 



Leu Lys Leu Asp He Glu Asn Phe Phe Asp Ser He Ser Trp Leu Gin 
85 90 95 



/ 



Val Trp Arg Val Phe Arg Gin Ala Gin Leu Pro Arg Asn Val Val Thr 
ibo 105 110 



Met Leu Thr ^Trp He Cys Cys Tyr Asn Asp Ala Leu Pro Gin Gly Ala 
115 ' 120 125 




Pro Thr Ser Pro Ala lie Ser Asn Leu Val Met Arg Arg Phe Asp Glu 
130 135 140 

Arg lie Gly Glu Trp Cys Gin Ala Arg Gly lie Thr Tyr Thr Arg T/r 
145 150 155 ieo 

Cys Asp Asp Met Thr Phe Ser Gly His Phe Asn Ala Arg Gin Va4 Lys 
165 170 yis 

Asn Lys Val Cys Gly Leu Leu Ala Glu Leu Gly Leu Ser Lei/ Asn Lys 
180 185 190 

Arg Lys Gly Cys Leu lie Ala Ala Cys Lys Arg Gin Gln/val Thr Gly 
195 200 20^ 

He Val Val 
210 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1640 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 279.. 1559 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CTCCGAGCCC GCCTCCGAGG ACGCGCTCGC GGCCCGGGCG GCGGGGGCGG ACGCGCGGCG 

GCGGCCCACG GAGACGCTTG ACCCGGGAGA (2GACGAATGA CGATAACGGC AGGTGCTCTC 

GGGAGAGGCC AGGGCTCGCA GATGAGCCAy GAGTACCGCG GTGTTTCGCC GCGGGGGTGT 

TCTGTCCCCA TCTCTTCGCC AGGGTCCCAg CGTACGCAAC GCAGGGAGCC CCGGGTCCAA 

CGCCTCGCAG GTCGTCCCCT GGCCTCTJCC GGAGCACC ATG AGC TGG TTC GAC 

Met Ser Trp Phe Asp 
1 5 

ACC ACC CTC TCC CGG CTC AAG/GGG TTG TTC AGC CGT CCC GTG ACA CGA 
Thr Thr Leu Ser Arg Leu Lygf Gly Leu Phe Ser Arg Pro Val Thr Arg 
10 / 15 20 



AGC ACC ACC GGG CTG GAC 
Ser Thr Thr Gly Leu Asp 
25 



CCG CTG GAT GCC CAC GGA CGT CCC CAG 
Pro Leu Asp Ala His Gly Arg Pro Gin 
30 35 



GAC GTC GTG ACG GAG ACG/GTC TCC ACG TCG GGC CCC CTG AAG CCA GGG 
Asp Val Val Thr Glu Thr/ Val Ser Thr Ser Gly Pro Leu Lys Pro Gly 
40 / 45 50 

CAC CTG CGA CAG GTC CC^ CGG GAT GCG CGG CTG CTC CCC AAG GGC GTC 
His Leu Arg Gin Val Arg Arg Asp Ala Arg Leu Leu Pro Lys Gly Val 
55 / 60 65 



60 
120 
180 
240 
293 

341 

389 

437 

485 




• 



CGC CGC TAG ACC CCG GGC CGG AAG AAG TGG ATG GAG GCC GCC GAG GCC 
Arg Arg Tyr Thr Pro Gly Arg Lys Lys Trp Met Glu Ala Ala Glu Ala 
70 75 80 85 



53^3 



CGG CGG CTG TTC TCC GCC ACG CTG CGC ACG CGG AAC CGG AAC CTG AGG 
Arg Arg Leu Phe Ser Ala Thr Leu Arg Thr Arg Asn Arg Asn Leu Arg 
90 95 100 



581 



GAC TTG CTG CCC GAC GAG GCA CAG CTG GCG CGC TAG GGC CTG CCG GTC 
Asp Leu Leu Pro Asp Glu Ala Gin Leu Ala Arg Tyr Gly Leu Pro Val 
105 110 115 / 



629 



TGG CGC ACG GAA GAG GAC GTG GCA GCG GCC CTG GGC GTC TCG GTG GGC 
Trp Arg Thr Glu Glu Asp Val Ala Ala Ala Leu Gly Val Ser Val Gly 
120 125 130 



677 



GTG CTC CGC CAC TAC AGC ATC CAC CGC CCG CGC GAG CGG Gt6 CGG CAC 
Val Leu Arg His Tyr Ser lie His Arg Pro Arg Glu Arg y&l Arg His 
135 140 145 / 

TAC GTG ACC TTC GCC GTG CCC AAG CGC TCC GGA GGC GTC 9GG^ CTG CTG 
Tyr Val Thr Phe Ala Val Pro Lys Arg Ser Gly Gly Vkl Ar^ Leu Leu 
150 155 160 / / 165 

CAT GCG CCC AAG CGG CGC CTG AAG GCC CTG CAA COfc CGG ATG^CTG GCG 
His Ala Pro Lys Arg Arg Leu Lys Ala Leu Gin Arg Arg Met Leu Ala 
170 175 / I / 180 



725 



773 



821 



CTC CTG GTG TCG AAG CTC CCC GTG AGT CCA CAC GCC-dAT GGC TTC GTG 
Leu Leu Val Ser Lys Leu Pro Val Ser Pro Gin Ala His Gly Phe Val 
185 190 / 195 



869 



CCC GGC CGC TCC ATC AAG ACG GGC GCC GC<^ CCG CAC GTG GGC CGG CGG 
Pro Gly Arg Ser lie Lys Thr Gly Ala A^a Pro His Val Gly Arg Arg 
200 205 / 210 



917 



GTG GTC CTG AAG CTG GAC CTG AAG 
Val Val Leu Lys Leu Asp Leu Lys As 
215 220 



TTC TTC CCC TCC GTC ACC TTC 
Phe Phe Pro Ser Val Thr Phe 
225 



965 



GCG CGG GTG CGA GGG CTG CTC ATC/gCC CTG GGC TAC GGC TAT CCC GTG 
Ala Arg Val Arg Gly Leu Leu IleT Ala Leu Gly Tyr Gly Tyr Pro Val 
230 235 / 240 245 

GCG GCC ACG CTC GCG GTG CTG A^TG ACG GAG TCC GAG CGC CAG CCC GTG 
Ala Ala Thr Leu Ala Val Leu /Met Thr Glu Ser Glu Arg Gin Pro Val 
250 / 255 260 



1013 



1061 



GAG CTG GAG GGC ATC CTC TTC CAC GTT CCC GTG GGC CCA CGC GTC TGC 
Glu Leu Glu Gly lie Leu Phe His Val Pro Val Gly Pro Arg Val Cys 
265 / 270 275 



1109 



GTG CAG GGC GCC CCC ACG/ AGC CCC GCC CTG TGC AAC GCG GTG CTG CTG 
Val Gin Gly Ala Pro Thr Ser Pro Ala Leu Cys Asn Ala Val Leu Leu 
280 / 285 290 



CTC 



CGA CTG GAC CGG CGG CTG GCG GGA CTG GCG CGT CGG TAC GGC TAC ACG 
Arg Leu Asp Arg Arg Eeu Ala Gly Leu Ala Arg Arg Tyr Gly Tyr Thr 
295 / 300 305 

TAC ACG CGC TAC GCG /GAT GAC CTC ACC TTC TCC GGC GAC GAC GTC ACG 
Tyr Thr Arg Tyr Ala' Asp Asp Leu Thr Phe Ser Gly Asp Asp Val Thr 
310 / 315 320 325 



1157 



1205 



1253 



GCG CTG GAG CGA GTC CGC GCG CTG GCC GCG CGG TAG GTG GAG GAG GAA 
Ala Leu Glu Arg Val Arg Ala Leu Ala Ala Arg Tyr Val Gin Glu Glu 
330 335 340 

GGC TTC GAG GTC AAC CGC GAG AAG ACC CGC GTG CAG CGC CGG GGC GGT 
Gly Phe Glu Val Asn Arg Glu Lys Thr Arg Val Gin Arg Arg Glv Glvy 
345 350 355 ^ 

GCC CAG CGC GTC ACT GGC GTC ACC GTG AAT ACG ACG CTG GGC TTG ^CA 
Ala Gin Arg Val Thr Gly Val Thr Val Asn Thr Thr Leu Gly Leu> 
360 365 370 

CGC GAG GAG CGG CCG CGG CTC CGG GCG ATG CTG CAC CAG GAG Caffcc CGG 
Arg Glu Glu Arg Pro Arg Leu Arg Ala Met Leu His Gin Glu Ala Arg 
375 380 385 



TCG GAG GAC GTC GAG GCA CAC CGC GCG CAC CTC GAC GGC 
Ser Glu Asp Val Glu Ala His Arg Ala His Leu Asp Gly 
390 395 400 

TAC GTG AAG ATG CTC AAC CCG GAG CAG GCG GAG CGG 
Tyr Val Lys Met Leu Asn Pro Glu Gin Ala Glu Arg 
410 415 

CGC AAG CCG CGC GGG ACG TGAGCGAGGG CTCAGCTCCG> 
Arg Lys Pro Arg Gly Thr 
425 



C CTG GCC 
u Leu Ala 
405 

CT CGC CGG 



^Ala Arg Arg 
420 



GGCCTGTCAC GCGTCCCGGC CTCCCAGTTG TCATGGCGg 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 3060 base pair 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 763. • 2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 



:gtcccagta c 



1349 



1397 



1445 



1493 



1541 




1640 



cccacttccg 


gcgctcgggc 


tgcgogaggg 


cccgtgcgag 


cacatgatgg 


cgctgcggct 


60 


cgtccaggtc 


cggcaccgcg 


ccgagcagga 


agcactgcgt 


cagacccccg 


cgggccgcca 


120 


gctcatccgc 


gcggagacgc 


gctcctacgt 


gcggcgcgag 


ccctccggcc 


aggagcaggt 


180 


gtaccgcgtc 


tcattggatg 


ggaaagtggt 


ggcggtggag 


tggggccccc 


gccaggggga 


240 


gtcccgccgg 


cagaagctct 


^ttcgacac 


ggacgccgag 


gcgcgcaccg 


cctacttcac 


300 


gcgcctggag 


tccttggccg 


cggagggata 


tatcgatgcg 


gctgcttcaa 


tgatgtagaa 


360 


cacgcaagcc 


acggggccoc 


gggcgcgcgg 


cggaaaggca 


ggtgcgacgg 


aacgacagac 


420 


actcgtgcga 


GCGACCGAX3A 


gaggtcccaa 


gccatcagcc 


tcagcgcctc 


gagcgcgaga 


480 


gcggcgttgc 


gccgctotgg 


ttgaattgca 


ggacactctc 


cgcaaggtag 


cctgttcttg 


540 



GCTCTCTTCC CTCCGGTGAG TACCTCTCCG GCCGGGGAGC TGAACCAACG ACGCAACCGC 

CGTTTCCCCG GCCGGAGAGG TACTCACCGG AGGGGAGAGC CGGTGAGGCT ACCGTGCCCC 

AGGTGAGAAG GTGGTGCCTT CGGGCCTCCC TCGACCGCTC GCGCTCCGTC GCCCTGCCCT 

GCCTCGCCCC CCCCACCTTG CTCACCGGCG CCAGGAGCCG TC ATG ACC GCC AAG 

Met Thr Ala Lys 
1 



CTG GAG TCA CAC GTC CCC GCC GCG CCC CCC GTC TCC GCC GAG GCG (ZCC 
Leu Glu Ser His Val Pro Ala Ala Pro Pro Val Ser Ala Glu Ala/Pro 
5 10 15 / 20 



822 



GCC CCC ACC CGT CCC GAT GCC GCG AAG CAG GAG GCC CGC CGC yGCC CAC 
Ala Pro Thr Arg Pro Asp Ala Ala Lys Gin Glu Ala Arg Arq^ Ala His 
25 30 / 35 

CAC GAG GCG CTG CGC CTG CGG TGG AAG GCC ATC GAA GAGyfeCG GGC^'GGC 
His Glu Ala Leu Arg Leu Arg Trp Lys Ala lie Glu Gli/ Ala^Gly Gly 
40 45 / 5t 



870 



918 



ACG GAC GCC TGG GTG CGG CAG CAG CTG GTG GCC AAG/GGCT GTC GCG GCG 
Thr Asp Ala Trp Val Arg Gin Gin Leu Val Ala Ly/ Gly Val ^la Ala 
55 60 7 /65 



966 



GAA GAG GTG GAC TTC GAG TCG CTC AGC GAC AAG /C AG A^G GCG GCC TGG 
Glu Glu Val Asp Phe Glu Ser Leu Ser Asp Ly/ Gliyliys Ala Ala Trp 
70 75 / St) 



1014 



AAG GAG AAG AAG AAG GCC GAG GCC ACC GAG/CGG CGC GCG CAG AAG CGC 
Lys Glu Lys Lys Lys Ala Glu Ala Thr Gli/ Arg Arg Ala Gin Lys Arg 
85 90 / 95 100 



1062 



CTG GCG TGG GAG GCC TGG AAG GCC ACG /CAC ATC CAC CAC CTG GGC GTG 
Leu Ala Trp Glu Ala Trp Lys Ala Thc^ His lie His His Leu Gly Val 
105 / 110 115 



1110 



GGG GTG CAC TGG GAC GAG GCC GGA 6GG CCG GAC AAG TTC GAC GTG GCC 
Gly Val His Trp Asp Glu Ala Gly/Gly Pro Asp Lys Phe Asp Val Ala 
120 / 125 130 



1158 



GGG CGC GAG GAG CGG GCC AAG GCC AAC GGC TTG CCG GAG GGG TTG GAC 
Gly Arg Glu Glu Arg Ala Lys Ala Asn Gly Leu Pro Glu Gly Leu Asp 
135 /l40 145 



1206 



TCG GTC GAG GCG CTG GCC 
Ser Val Glu Ala Leu Ala 
150 



GCG CTG GGC ATC TCC GTG TCG CGC CTG 
^s Ala Leu Gly lie Ser Val Ser Arg Leu 
)5 160 



1254 



CGC TGG TTC TCC TTC CAG CGC GAG GTG GAC ACG GGC ACG CAC TAG CAG 1302 
Arg Trp Phe Ser Phe Hi/s Arg Glu Val Asp Thr Gly Thr His Tyr Gin 
165 1/70 175 180 

ACG TGG GAG ATT CCG/AAG CGG GAC GGC GGC AAG CGG ACG CTC ACC GCG 1350 
Thr Trp Glu lie Pro^ Lys Arg Asp Gly Gly Lys Arg Thr Leu Thr Ala 
185 190 195 

CCG AAG CGG GAG OTC AAG GCC GTG CAG CGC TGG GTG CTC GCG AAC GTG 1398 
Pro Lys Arg Glu Leu Lys Ala Val Gin Arg Trp Val Leu Ala Asn Val 
200/ 205 210 




GTG GAG CGG CTG CCG GTG CAC GGG GCC GCG CAC GGC TTC GTG GCG GGG 

Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala Gly 

215 220 225 

CGC TCC ATC CTC ACC AAC GCG CTG GCC CAC CAG GGC GCG GAC GTG GTG 

Arg Ser lie Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val Val 

230 235 240 

GTG AAG GTG GAC ATG AAG GAC TTC TTC CCT TCC GTG ACG TGG CCC CGG 

Val Lys Val Asp Met Lys Asp Phe Phe Pro Ser Val Thr Trp Pro Arg 

245 250 255 260 



1446 



GTC AAG GGA CTG CTG CGC AAG GGA GGA CTC CCG GAG AAC CTG GCG 
Val Lys Gly Leu Leu Arg Lys Gly Gly Leu Pro Glu Asn Leu Ala 
265 270 275 

CTC CTG GCG CTG CTC TCC ACC GAG GCC CCG CGC GAG GTG 
Leu Leu Ala Leu Leu Ser Thr Glu Ala Pro Arg Glu Val 
280 285 

CGG GGA GAG ACG CTG TAG GTG GCC AAG GGC CCT CGC GCG 
Arg Gly Glu Thr Leu Tyr Val Ala Lys Gly Pro Arg Al 
295 300 30 

GGG GCC CCC ACC TCT CCG GCG CTG ACG AAC GCG CTG tGC CTG CGGXCTG 
Gly Ala Pro Thr Ser Pro Ala Leu Thr Asn Ala Leiy^Cys/Leu Arg Leu 
310 315 32, 




1590 



1734 



GAC AAG CGG CTC TCG GCG CTG TCG AAG CGG CTG JC^C 
Asp Lys Arg Leu Ser Ala Leu Ser Lys Arg Ley^ Gly 
325 330 3- 



iCG TAC ACG 
Thr Tyr Thr 
340 



1782 



CGC TAT GCG GAT GAC CTG ACG TTC TCC TGG yCGG CGG GCG AAG AAG TCC 

Arg Tyr Ala Asp Asp Leu Thr Phe Ser Trp^ Arg Arg Ala Lys Lys Ser 

345 350 355 

CGG CAG AAG GAA CTC CCC CTG GCG GAT yCCG CCG GTG GCG CTG CTC CTG 

Arg Gin Lys Glu Leu Pro Leu Ala Ast/ Ala Pro Val Ala Leu Leu Leu 
360 3^ 370 

GCG CGG GTG AAG GGT GTG CTG GAG X5CC GAG GGT TTC ACG CTG CAC CCG 

Ala Arg Val Lys Gly Val Leu Glu/Ala Glu Gly Phe Thr Leu His Pro 
375 38JE) 385 



GAC AAG ACG CGG GTG CAG CGC 
Asp Lys Thr Arg Val Gin Arg 
390 39 



CTC GTG GTG AAC GAG GCC 
Leu Val Val Asn Glu Ala 
405 410 




\G GGC AGC CGG CAG CGG GTG ACG GGG 
l^ys Gly Ser Arg Gin Arg Val Thr Gly 
400 



CC5C GAG GGC GTT CCG GGT GCC CGG GTG CCC 
Pro Glu Gly Val Pro Gly Ala Arg Val Pro 
/ 415 420 



1830 



1878 



1926 



1974 



2022 



CGC GAT GTG GTG CGG CGG 
Arg Asp Val Val Arg * — 
425 



CTG CGC GCG GCG ATC CAC AAC CGG GAG CAG 
Leu Arg Ala Ala lie His Asn Arg Glu Gin 
430 435 



GGC AAG CCC GGC CCC ACC GGG GAG ACG CTG GAG CAG CTC AAG GGG CTC 
Gly Lys Pro Gly Pro/Thr Gly Glu Thr Leu Glu Gin Leu Lys Gly Leu 
440 / 445 450 



2070 



2118 



GCG GCC TTC CTT CAC ATG ACG GAC GCG GAG AAG GGC CGC GCC TTC CTG 
Ala Ala Phe Leu hIb Met Thr Asp Ala Glu Lys Gly Arg Ala Phe Leu 
455 / 460 465 



2166 



If • 

CGA CGG CTG GAG GCC CTC GAG AAG CGC CAG ACC GCC TGACCCTCAC 2212 
Arg Arg Leu Glu Ala Leu Glu Lys Arg Gin Thr Ala 
470 475 480 



TfiflTPGTCCG 


GGGCATCGCA 


GCGGGCGCCG 


GGACGGACCG 


TCACCCCCCA 


GATCTCCATG 


2272 


PCATGCTGGG 


GATTCTGGGC 


GGTGAAGAAG 


ACTTCCCAGC 


CGAGACGGAC 


GAAGCCCTGC 


2332 




ACTCCTCGCC 


CGGGGCGATC 


TCCCGGAGGG 


GCACCGTTCC 


GACGTCCGTG 


2392 


CCATTGCTCA 


CCCAGGGCTC 


CCGGCCCCAG 


CCTTGGGTGT 


CCGCCGAGAA 


GAAGAGCAGC 


2452 


CCGGAGATGG 


CCGTCAGGTT 


CTCCGGCGAC 


GCATCCTCGG 


GGCCCGGCGC 


CAAATCCTTC 


2512 


AGCAGCAGGG 


TGCCCTTGGC 


GGTGCCATCG 


CTGGACCACA 


GCTCCCGGCC 


GTGGAGGCTG 


2572 


TCACTCGCGG 


CGAAGTAGAG 


CATCCCATTC 


AGCGCCTTGA 


TGGCGCTGGG 


CGCCGAGCTG 


2632 


TCCGGACCCG 


GCCAGATGTC 


CTTCACCCGG 


ACCGTGCCAT 


GCGACGTGCC 


ATCGCTGACC 


2692 


CACAGCTCCT 


CGCCCTCGGG 


CTGGCCCCAG 


AACTCGGGCT 


CGCCTCCCCC 


GGCGCTGAAG 


2752 


AAGATCTTCC 


CCCCGAGCGC 


CGTGAGATCA 


TGCGGATAGA 


GGCCGGGGAA 


GAAGCGCAGC 


2812 


TGCTCGGAGA 


CGGTGCCTCT 


GGAGCACCAC 


AGGCTGGCCT 


CGCCTTCGTC 


ATTGTCGAGC 


2872 


AGGAAGAAGA 


GCACCGAGTC 


CGCCGCGGTG 


AACGCGGAGA 


GGAAGTTGTC 


CTCGGGGCCC 


2932 


GTGAAGACAG 


ACGTGGTGCT 


GGACAGCCCC 


AGGCTGCGCC 


AGATGAACAC 


CTCGTCATTG 


2992 


ACGTTGGCCA 


CGAAGAAGAG 


CGCATCGCCG 


ACCCGGGTGA 


GCCGGCGCGG 


GCTGGAGCTG 


3052 


CCGGGCAC 












3060 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2788 base pairs 

(B) TYPE: nucleic acid 
(0) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.»103 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 707.. 1654 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1644.. 2591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39:^ 

T TTC GAG AAG CGC CAT ACC AAA CAG GGG ATA CAG ACC AAC CTG ACG 46 
Phe Glu Lys Arg His Thr Lys Gin Gly lie Gin Thr Asn Leu Thr 
15 10 15 



If 



CTG AAA GAG GAA AGC TAG GGC GAG TGG CTG CCG AAG TGC GAC GAG CCC 
Leu Lys Glu Glu Ser Tyr Gly Asp Trp Leu Pro Lys Cys Asp Asp Pro 
20 25 30 

GCA GCA ACA TAACCTGACT CAGACCGGCA ACAGCCGGTC TTTTGCTTTC 
Ala Ala Thr 




TGGCCATTGC CACAAGGTGA ACAATCCACT GTTCACCCTT CACCGTTTAT TCACC^TTTA 203 

TCACTATGAA ATTATTAATA AAAAACCAGA GGTGAACAGT GTGAACAGTA AAA^CTGAAA 263 

AAACTTTTTA TCACCCCGCG CATCGCCCGA CTGGACAGAT CCAGAACGAG CAAAAATCAC 32 3 

AAAGGTGACG AGTCGACTGT TCACTCTTCA CCAACTCATC ACCACCTAAGi^ACATGATAT 383 

AAAATGATAA ATAATCGAGG TGAACAGTTA AATGCAAAAA AACTTTTTCT CAGCTCTTGG 443 

ATAAAAGAAA ATTAATTCAC ATCAATAGCT TTCCTCTTGA ATCCTCTTGA GGTTTATGAG 503 

AGCGTAACAG AGCCAAACCT AGCATTTTAT GGGTTAATAG C^ldATCGCGC ATGAGTCATG 563 

GTTTCGCCTA GTATTTTAGC TATGCCCGTC GTTCAGTTCG ,CT^GCGG<5G GCTGGGGGCC 62 3 




ACCGATCAGC GAACTGATCG ACGTGCTCAA GTAGGTTTGG/CTC;PTTTAGT CCTCTACCAT 683 

CAAGGTGCAT AAGGATATTC TOG ATG CTG ACT C^G CK(^AAA AAA AAT GGT 733 

Met Leu Thr Gin Leu Lys Lys Asn Gly 

' / ' 

ACT GAG GTA TCT AGA GCA ACC GCG TTA/TTT TCA TCA TTC GTT GAA AAG 781 

Thr Glu Val Ser Arg Ala Thr Ala Leu Phe Ser Ser Phe Val Glu Lys 
10 15 / 20 25 

AAC AAA GTA AAA TGT CCT GGT AAT /GTA AAA AAA TTC GTC TTT CTG TGT 829 
Asn Lys Val Lys Cys Pro Gly Asn Val Lys Lys Phe Val Phe Leu Cys 
30 ^ 35 40 

GGT GCT AAC AAA AAC AAT GGA/GAA CCA TCA GCA AGA CGA TTG GAA TTA 877 
Gly Ala Asn Lys Asn Asn Gly^ Glu Pro Ser Ala Arg Arg Leu Glu Leu 
45 / 50 55 

ATA AAT TTT TCT GAA AGG TAT TTG AAT AAC TGT CAC TTT TTT CTT GCT 925 
lie Asn Phe Ser Glu Arg/Tyr Leu Asn Asn Cys His Phe Phe Leu Ala 
i 60 7 65 70 

GAA CTA GTT TTC AAA GAA TTA AGC ACC GAT GAA GAA TCA TTA TCT GAT 973 
Glu Leu Val Phe Lys Glu Leu Ser Thr Asp Glu Glu Ser Leu Ser Asp 
75 / 80 85 

AAT TTA TTA GAT ATC GAA GCT GAC TTA TCT AAA TTA GCT GAT CAT ATT 1021 
Asn Leu Leu Asp Ide Glu Ala Asp Leu Ser Lys Leu Ala Asp His lie 
90 / 95 100 105 

ATC ATT GTT TTA^ GAA AGT TAT TCA TCT TTC ACG GAA CTT GGT GCA TTC 1069 
lie lie Val Leu Glu Ser Tyr Ser Ser Phe Thr Glu Leu Gly Ala Phe 
/ 110 115 120 

GCA TAC AGC AAG CAA TTA CGC AAG AAA TTA ATA ATA GTT AAC AAT ACA 1117 
Ala Tyr Ser/Lys Gin Leu Arg Lys Lys Leu lie lie Val Asn Asn Thr 
125 130 135 



AAA TTT ATA AAT GAG AAA TCA TTT ATA AAT ATG GGA CCA ATA AAG GCT 
Lys Phe lie Asn Glu Lys Ser Phe lie Asn Met Gly Pro lie Lys Ala 
140 145 150 



ai65 



ATT ACT CAG CAA TCA CAA CAA TCT GGT CAT TTC TTA CAT TAT AAA ATG 
lie Thr Gin Gin Ser Gin Gin Ser Gly His Phe Leu His Tyr Lys MeX 
155 160 165 



1213 



ACA GAA GGT ATT GAA AGT ATA GAG CGC TCT GAT GGG ATT GGC GAA ATA 
Thr Glu Gly lie Glu Ser lie Glu Arg Ser Asp Gly He Gly Glu lie 
170 175 180 /r 185 



1261 



TTC GAC CCC CTA TAT GAT ATT CTT TCT AAG AAC GAC AGA GCA ATT TCA 
Phe Asp Pro Leu Tyr Asp He Leu Ser Lys Asn Asp AroAla He Ser 
190 195 / 200 



1309 



AGA ACT TTA AAA AAA GAA GAG TTA GAT CCT TCC AGtAaC TTC AAT AAA 
Arg Thr Leu Lys Lys Glu Glu Leu Asp Pro Ser Seac Asn Phe Asn Lys 
205 210 

GAC TCA GTA CGA TTT ATT CAT GAC GTA ATT TTTi^GTA^TGT GGT CCT TTG 
Asp Ser Val Arg Phe He His Asp Val He \^ Cys Gly Pro Leu 

220 225 ^ 

CAA CTT AAT GAA CTC ATC GAA ATA ATC AGA AAA ATA TTT GGC ACA GAA 
Gin Leu Asn Glu Leu He Glu He He jfir Lys He Phe Gly Thr Glu 
235 240 / 245 




1357 



1405 



1453 



AGC CAT TAC AAA AAA AAT CTT CTA AAG CAC CTT GGT ATT CTA ATA GCT 
Ser His Tyr Lys Lys Asn Leu Leu ^ys His Leu Gly He Leu He Ala 

260 265 



250 



255 



1501 



ATT AGA ATA ATA TCA TGC ACA AAT GGG ATT TAT TAT TCT TTG TAT AAA 
He Arg He He Ser Cys Thr vAsn Gly He Tyr Tyr Ser Leu Tyr Lys 
270 / 275 280 

GAA TAT TAT TTT AAA TAT GAC TTT GAC ATT GAC AAC ATA TCA TCA ATG 
Glu Tyr Tyr Phe Lys Tyr Asp Phe Asp He Asp Asn He Ser Ser Met 
285 / 290 295 

TTT AAA GTT TTT TTC 
Phe Lys Val Phe Phe 
300 

GAG AAT ATA TAGCC 
Glu Asn He 
315 




C AAG AAC AAG CCA GAA AGG ATG AGG GTA TAT 
Lys Asn Lys Pro Glu Arg Met Arg Val Tyr 
305 310 



TT GATTCTCAGA CATTGATGAC TAAGGGATTT 



/ 



3 L 



I 



PCGA< 



1549 



1597 



1645 



1694 



ACCTGAGCCG 


CCAAAAAAAT 


GGGATATAGC 


TAAGAAAAAA 


1754 


TCACCCGTCA 


TCAAAAGTTA 


AATTAATTCA 


ATATTGGTTA 


1814 


GCTCCCAATG 


CATAATGCTG 


CATATGCATT 


TGTTAAAAAC 


1874 


TTTATTACAT 


GCCGAATCAA 


AGAATAAGTA 


TTATGTGAAA 


1934 


CCCTTCAATA 


AAATTTACTG 


ATTTTGAGTA 


CGCATTCACT 


1994 


ATTTACTACA 


GAATATGATA 


AGGAGTTACT 


ACAACTTATA 


2054 


AGATAGCACT 


CTCCCTATCG 


GGTTTCCTAC 


ATCTCCATTA 


2114 


AGAACTTGAT 


GAAAAACTGA 


CGCAAAAACT 


AAATGCAATT 


2174 



• 



GATAAACTTA 
AAAGGGGCTA 
GACTTTAAAA 
GTTACCGGAT 
AAAATAAGAT 
AAACTTTCTG 
AACAGAAAAT 
ACTTTATATT 
TTTCAGGATG 
CCTATATAAA 
CGCCAAAATG 



ATGCCACTTA 
GCAAATTAAT 
TTAACATTAA 
TGAAAGTTTG 
TGCATCTTTC 
GTTATATTGC 
ATTTTCAAGA 
TTGGATGCAC 
ACCTACACTC 
ATACAGCAAA 
CCTC 



TACACGATAT 
TCTGGATTGT 
AAAATTTAAG 
CCACGATTTT 
TCTTTTATCA 
TTATGCAAAA 
AATAAAATGG 
CCCAATAACT 
TAGAGAATGT 
ATCAATGCAT 



GCTGATGATA 
TTTAAAAGAA 
ATTTGTAGTG 
CATATTACAT 
AAGGGCATAT 
GATATAGACC 
ATTCAGAATC 
TCATTGATTA 
GTATACAAAA 
TGGCGGCATT 



TTATTGTCTC TACAAATATG 
CAATGAAAGA GATTGGTCCA 
CTTCGGGAGG AAGTATAGTA 
TACATAGATC AATGAAAGAT 
TAAAAGATGA AGATCATAA' 
CTCATTTTTA TACTUVAACTG 
TCCACAACAA AGTTGAATAA 

/ 

AATTGGGAAC AATATAGGCT 
GTGTATAAGT/TA'fTTTCAAA 

, // 

TTACCACTCC /TGTGATCTTC 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 316 amino ac 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Arg lie Tyr Ser Leu He Asp Ser Gin Thr Leu Met 
1 5 / 10 

Phe Ala Ser Glu Val Met Arg Ser Pro Glu Pro Pro Lys 
20 / 25 





Thr Lys Gly 
15 

Lys Trp Asp 
30 



He Ala Lys Lys Lys Gly Gly/Met Arg Thr He Tyr His Pro Ser Ser 
35 7 40 45 

Lys Val Lys Leu He Gin Tyr Trp Leu Met Asn Asn Val Phe Ser Lys 
50 / 55 60 



Leu Pro Met His Asn Ala Ala Tyr Ala Phe Val Lys Asn 



65 



/70 



75 



Lys Ser Asn Ala Leu /Leu His Ala Glu Ser Lys Asn Lys 
85^ 90 



/ 



Lys He Asp Leu Lys Asp Phe Phe Pro Ser He Lys Phe 
100 / 105 

Glu Tyr Ala Phe/Thr Arg Tyr Arg Asp Arg He Glu Phe 
115 / 120 125 

Tyr Asp Lys G]Lu Leu Leu Gin Leu He Lys Thr He Cys 
130 / 135 140 

Asp Ser Thr /Leu Pro He Gly Phe Pro Thr Ser Pro Leu 
145 / 150 155 



Arg Ser He 
80 

Tyr Tyr Val 
95 

Thr Asp Phe 
110 

Thr Thr Glu 
Phe He Ser 



He Ala Asn 
160 



Phe Val Ala Arg Glu Leu Asp Glu Lys Leu Thr Gin Lys Leu Asn Ala 
165 170 175 

lie Asp Lys Leu Asn Ala Thr Tyr Thr Arg Tyr Ala Asp Asp lie lie 
180 185 190 

Val Ser Thr Asn Met Lys Gly Ala Ser Lys Leu lie Leu Asp Cys Phe 
195 200 205 

Lys Arg Thr Met Lys Glu lie Gly Pro Asp Phe Lys lie Asn lie Lys 
210 215 220 

Lys Phe Lys He Cys Ser Ala Ser Gly Gly Ser He Val Val Thr Gly 
225 230 235 / 240 

Leu Lys Val Cys His Asp Phe His He Thr Leu His Arg Ser/ 
245 250 

Asp Lys He Arg Leu His Leu Ser Leu Leu Ser Lys GLw^Ile Leu Lys 
260 265 /7 270 

Asp Glu Asp His Asn Lys Leu Ser Gly Tyr He Ala /Tyr Ala Lys Asp 
275 280 // 285^^ 

He Asp Pro His Phe Tyr Thr Lys Leu Asn Arg/liys^^yr Phe Gin Glu 
290 295 ^ ^aOO 

He Lys Trp He Gin Asn Leu His Asn Lys/Val Glu 
305 310 / 315 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS:/ 

(A) LENGTH: 1602 base p4irs 

(B) TYPE: nucleic acid^ 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear"^ 

/ 

(ix) FEATURE: / 

(A) NAME/KEY: CDS 

(B) LOCATION: 548.. 1507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

TGGCATCTAT TAAGAAGGTT AGGAAAGAAA ATAAAGTATC AAAAGATATT GGAAATATAT 60 

TATACGCAGA GCGTTTCTAT TGCCTTGTAT CTATTTACTG GATAGTGTCA ACTACCGCAC 120 

ACTGTGTGAA CTAGC^TTTA AAGCGATAAA GCAAGATGAT GTTTTATCTA AAATTATTGT 180 

TAGATCCGTT GTTTCTCGTC TAATAAATGA ACGAAAAATA CTTCAAATGA CTGATGGTTA 240 

TCAGGTCACT GC^TTGGGGG CTAGCTATGT TAGGAGCGTC TTTGATAGAA AGACACTTGA 300 

CCGATTGCGG CTTGAGATTA TGAATTTTGA AAACCGTAGA AAATCAACAT TTAACTATGA 360 

TAAGATTCCG TATGCGCACC CTTAGCGAGA GGTTTATCAT TAAGGTCAAC CTCTGGATGT 420 

tgtttcggca/tcctgcattg AATCTGAGTT ACTGTCTGTT TTCCTTGTTG GAACGGAGAG 480 

CATCGCCTGA TGCTCTCCGA GCCAACCAGG AAACCCGTTT TTTCTGACGT AAGGGTGCGC 540 



AACTTTC ATG AAA TCC GCT GAA TAT TTG AAC ACT TTT AGA TTG AGA AAT 

Met Lys Ser Ala Glu Tyr Leu Asn Thr Phe Arg Leu Arg Asn 

15 10 

CTC GGC CTA OCT GTC ATG AAC AAT TTG CAT GAC ATG TCT AAG GCG ACT 

Leu Gly Leu Pro Val Met Asn Asn Leu His Asp Met Ser Lys Ala Thr 

15 20 25 30 

CGC ATA TCT GTT GAA ACA CTT CGG TTG TTA ATC TAT ACA GCT GAT TTT 

Arg lie Ser Val Glu Thr Leu Arg Leu Leu lie Tyr Thr Ala Asp Phe 

35 40 45 




CGC TAT AGG ATC TAC ACT GTA GAA AAG AAA GGC CCA GAG AAG AGA ATG 
Arg Tyr Arg lie Tyr Thr Val Glu Lys Lys Gly Pro Glu Lys Arg Met 
50 55 60/ 



AGA ACC 
Arg Thr 



ATT TAC 
lie Tyr 
65 



CAA CCT 
Gin Pro 



TCT CGA 
Ser Arg 
70 



GAA CTT 
Glu Leu 



AAA GCC TTA CAA GGA TGG 
Lys Ala Leu /Gln'^Gl/y Trp 



733 



781 



GTT CTA 
Val Leu 
80 

GGA TTT 
Gly Phe 
95 



CGT AAC 
Arg Asn 



GAA AAG 
Glu Lys 



ATT TTA 
lie Leu 



CAC CAA 
His Gin 
100 



GAT AAA 
Asp Lys 
85 

TCT ATT 
Ser lie 



CTG TCG 
Leu Ser 



TTG AAT 
Leu Asn 



TCA TCT eCT/TTT TCT ATT 
Ser Sei/Pro Phe Ser lie 




AAT/GCT/ACC 
Asn^Ala Ti 
105 



G CAT ATT 
Pro His lie 
110 



829 



877 



GGG GCA AAC TTT ATA CTG AAT ATT GAT TTG/GAG GAT TTT TTC CCA AGT 
Gly Ala Asn Phe lie Leu Asn lie Asp Leu Glu Asp Phe Phe Pro Ser 
115 12'0 125 



925 



TTA ACT GCT AAC AAA GTT TTT GGA GTG/TTC CAT TCT CTT GGT TAT AAT 
Leu Thr Ala Asn Lys Val Phe Gly Val Phe His Ser Leu Gly Tyr Asn 
130 135 140 



973 



CGA CTA ATA 
Arg Leu lie 
145 

CTA CCA CAA 
Leu Pro Gin 
160 

TCT AAA CTT 
Ser Lys Leu 
175 



TCT TCA 
Ser Ser 



GGT GCT 
Gly Ala 



GAT TAT 
Asp Tyr 



GTT TTG ACA 
Val Leu Thr 

15^0 

/ 

CCA TCA /TCA 
Pro Ser/ Ser 
165 

CGT ATT CAG 
Arg/lle Gin 
180^ 



/AAA ATA TGT TGT TAT 
Lys lie Cys Cys Tyr 
155 

CCT AAA TTA GCT AAT 
Pro Lys Leu Ala Asn 
170 

GGT TAT GCA GGT AGT 
Gly Tyr Ala Gly Ser 
185 



AAA AAT CTG 
Lys Asn Leu 



CTA ATA TGT 
Leu lie Cys 



CGG GGC TTG 
Arg Gly Leu 
190 



1021 



1069 



1117 



ATA TAT ACG 
lie Tyr Thr 



AGA TAT 
Arg Tyr 
195 



GCC GAT GAT 
Ala Asp Asp 



CTC ACC TTA TCT GCA 
Leu Thr Leu Ser Ala 
200 



CAG TCT ATG 
Gin Ser Met 
205 



1165 



AAA AAG GTT GTT AAA GCA CGT GAT TTT TTA TTT TCT ATA ATC CCA AGT 
Lys Lys Val Val Lys Ala Arg Asp Phe Leu Phe Ser lie lie Pro Ser 
210 215 220 



1213 



GAA GGA TTG GTT ATT AAC TCA AAA AAA ACT TGT ATT AGT GGG CCT CGT 
Glu Gly Leu Val lie Asn Ser Lys Lys Thr Cys lie Ser Gly Pro Arg 
225 230 235 



1261 



AGT CAG AGG AAA GTT ACA GGT TTA GTT ATT TCA CAA GAG AAA GTT GGG 
Ser Gin Arg Lys Val Thr Gly Leu Val He Ser Gin Glu Lys Val Gly 
240 245 250 



1309 



ATA GGT AGA GAA AAA TAT AAA GAA ATT AGA GCA AAG ATA 
lie Gly Arg Glu Lys Tyr Lys Glu lie Arg Ala Lys lie 
255 260 265 

TTT TGC GGT AAG TCT TCT GAG ATA GAA CAC GTT AGG GGA 
Phe Cys Gly Lys Ser Ser Glu lie Glu His Val Arg Gly 
275 280 

TTT ATT TTA AGT GTG GAT TCA AAA AGC CAT AGG AGA TTA 
Phe lie Leu Ser Val Asp Ser Lys Ser His Arg Arg Leu 
290 295 

ATT AGC AAA TTA GAA AAA AAA TAT GGA AAG AAC CCT TTA 
lie Ser Lys Leu Glu Lys Lys Tyr Gly Lys Asn Pro Leu 
305 310 315 

AAG ACC TAATGGTCTT CGTTTTAAAA CTAAAGCTCA TAGGTTGAAA 
Lys Thr 
320 



TTCTTCGTCC AACCAGTTAT TTAGTTCCTG CAATCGTTTC TGCAG. 



CAT CAT ATA 
His His He 
270 

TGG TTG TCA 
Trp Leu Ser 
285 

ATA ACT TAT 
He Thr T^ 
300 

AAT AAA^GCG 
Asn Lys Ala 



1357 





1501 



1557 



1602 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CH7VRACTERISTICS : 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 39 6.. 13 52 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 




TCACCCTGAA 


AGACCTGATT 


GCTTACCTGG 


AAGAGAAGCC 


GGAAATGGCG 


GAACATCTGG 


60 


CGGCGGTTAA 


GGCCTATCGC 


GAAGACTTCG 


GCGTTTAAAA 


ATATGCGCT6 


TGCAGGGTTT 


120 


TTGCTGTGCG 


CAGCGTGATG 


/ 

CGC^CAAGA 


TATCGTGTTA 


ATCTGCTTTC 


GCCAGCAGTG 


180 


GCAATAGCGT 


TTCCGGCCTT 


TTGTGCCGGG 


AGGGTCGGCG 


AGTCGCTGAC 


TTAACGCCAG 


240 


TAGTATGTCC 


ATATACCCAA 


AXSTCGCTTCA 


TTGTACCTGA 


GTACGCTTCG 


CGTACGTCGC 


300 


GCTGACGCGC 


TCAGTACAG^ 


TACGCGCCTT 


CGGGATGGTT 


TAATGGTATT 


GCCGCTGTTG 


360 


GCGCCTCTTT 


TGGCCGCCGT 


GATGTGGAGA 


GTGGA ATG GAT GCT ACC 


CGG ACA 


413 



Met Asp Ala Thr Arg Thr 
1 5 



ACC CTT CTG GCG CTC GAT TTG TTC GGC TCG CCG GGC TGG AGC GCC GAT 
Thr Leu Leu Ala Leu Asp Leu Phe Gly Ser Pro Gly Trp Ser Ala Asp 
10/ 15 20 



AAA GAA ATA CAG CGA CTG CAT GCG CTC AGT AAT CAT GCC GGA CGC CAT 
Lys Glu He Glh Arg Leu His Ala Leu Ser Asn His Ala Gly Arg His 
25 / 30 35 



461 



509 



TAG CGA CGC ATT ATT CTT TCT AAA CGC CAC GOT GGT CAG CGG CTG GTG 
Tyr Arg Arg He He Leu Ser Lys Arg His Gly Gly Gin Arg Leu Val 
40 45 50 

TTA GCC CCT GAT TAG TTG CTG AAA AGG GTA GAG GGG AAG ATT GTT AAG 
Leu Ala Pro Asp Tyr Leu Leu Lys Thr Val Gin Arg Asn He Leu Lys 
55 60 65 7( ' 



AAG GTG GTT TGA GAA TTT GGG GTT TGG GGT TTT GGT AGA GGG TAG (ZGA 
Asn Val Leu Ser Gin Phe Pro Leu Ser Pro Phe Ala Thr Ala Tyir/Arg 
75 80 



653 



GGA GGT TGG GGA ATG GTG AGG AAG GGG GAG GGA GAG TGG GAA/C^ GGG 
Pro Gly Gys Pro He Val Ser Asn Ala Gin Pro His Gys Gl/y<iln Pro 
90 95 1( 



701 



GAG ATG GTG AAA GTG GAT ATG GAA AAG TTT TTG GAT AGC/^TT AGG TGG 
Gin He Leu Lys Leu Asp He Glu Asn Phe Phe Asp Se^ He Ser Trp 
105 110 



749 



TTA GAG GTG TGG GGT GTG TTT GGG GAG GGG GAG T 
Leu Gin Val Trp Arg Val Phe Arg Gin Ala Gin L#u Pro 
120 125 



GGA GGT/AAT GTG 
'rg Asn Val 



797 



GTA AGG ATG GTG AGG TGG ATT TGT TGT TAT AA^-^AG GGG TTA GGG GAG 
Val Thr Met Leu Thr Trp He Gys Gys Tyr Afen Asp Ala Leu Pro Gin 
135 140 A4S 150 



845 



GGG GGA GGA AGT TGG GGA GGG ATT TGG AAT GTT GTG ATG GGG GGT TTT 
Gly Ala Pro Thr Ser Pro Ala He Ser ^n Leu Val Met Arg Arg Phe 
155 /l60 165 



893 



GAT GAA GGG ATA GGG GAA TGG TGT GXG GGT GGG GGA ATT AGG TAG ACC 
Asp Glu Arg He Gly Glu Trp Gys Gin Ala Arg Gly He Thr Tyr Thr 
170 /l75 180 



941 



CGC TAG TGC GAT GAC ATG ACC T^T TGA GGT CAC TTG AAT GCC CGG CAG 
Arg Tyr Cys Asp Asp Met Thr Phs Ser Gly His Phe Asn Ala Arg Gin 
185 /l90 195 



989 



GTT AAA AAT AAA GTG TGC 
Val Lys Asn Lys Val Cys 
200 



TTG TTA GGG GAG CTG GGG GTG AGG GTC 
Leu Leu Ala Glu Leu Gly Leu Ser Leu 
210 



1037 



AAT AAA CGC AAA GGG TGC CTG ATA GGT GCC TGT AAG CGC GAG CAA GTA 
Asn Lys Arg Lys Gly oys Leu He Ala Ala Cys Lys Arg Gin Gin Val 
215 A20 225 230 



1085 



ACC GGG ATT GTT GT^ AAT CAC AAG CCA CAG CTT GCC GGT GAA GCG CGC 
Thr Gly He Val v/l Asn His Lys Pro Gin Leu Ala Arg Glu Ala Arg 
J5 240 245 



1133 



CGG GCG CTG CGy CAG GAG GTG CAT TTG TGC GAA AAA TAT GGG GTT ATT 
Arg Ala Leu hr6 Gin Glu Val His Leu Cys Gin Lys Tyr Gly Val He 
"sO 255 260 



1181 



TGG CAT CTT AGT CAT GGT GGT GAA CTT GAT CCT TCT GGG GAT CTG CAC 
Ser His Leu/ser His Arg Gly Glu Leu Asp Pro Ser Gly Asp Leu His 
269^ 270 275 



1229 



GCA CAG GG*A AGG GCG TAT CTT TAT GCT TTG CAG GGA AGA ATA AAC TGG 
Ala Gin Ala Thr Ala Tyr Leu Tyr Ala Leu Gin Gly Arg He Asn Trp 
280 / 285 290 



1277 



/ 

/ 



TTA TTG CAA ATC AAC CCT GAG GAT GAG GCC TTT CAA CAG GCG AGA GAG 

Leu Leu Gin lie Asn Pro Glu Asp Glu Ala Phe Gin Gin Ala Arg Glu 
295 300 305 310 

AGT GTA AAG CGA ATG CTG GTT GCA TGG TAAGAAAAGC GTCAGGCAGA 

Ser Val Lys Arg Met Leu Val Ala Trp 
315 



CGTTTCTGCC TGACCGTTTA GGGGAGAATT ACTGCAACTG CGCGGCAATl 
GGGCGTCAAA ATCATCCGTC GGGCGGTATT TAAACTCGCT GCGGAC| 
TACCTTCACA GAAGGCCAGG ATCTGGCTTG CCAGCAGGGT TTC| 



^CGGCCAGC 



CGTGACAGCA 



/ 



1372 

1432 
1492 
1540 



